Machine Learning最新文献_第10页

Optimal clustering from noisy binary feedback 从噪声二进制反馈中优化聚类

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-22 DOI: 10.1007/s10994-024-06532-z

Abstract

We study the problem of clustering a set of items from binary user feedback. Such a problem arises in crowdsourcing platforms solving large-scale labeling tasks with minimal effort put on the users. For example, in some of the recent reCAPTCHA systems, users clicks (binary answers) can be used to efficiently label images. In our inference problem, items are grouped into initially unknown non-overlapping clusters. To recover these clusters, the learner sequentially presents to users a finite list of items together with a question with a binary answer selected from a fixed finite set. For each of these items, the user provides a noisy answer whose expectation is determined by the item cluster and the question and by an item-specific parameter characterizing the hardness of classifying the item. The objective is to devise an algorithm with a minimal cluster recovery error rate. We derive problem-specific information-theoretical lower bounds on the error rate satisfied by any algorithm, for both uniform and adaptive (list, question) selection strategies. For uniform selection, we present a simple algorithm built upon the K-means algorithm and whose performance almost matches the fundamental limits. For adaptive selection, we develop an adaptive algorithm that is inspired by the derivation of the information-theoretical error lower bounds, and in turn allocates the budget in an efficient way. The algorithm learns to select items hard to cluster and relevant questions more often. We compare the performance of our algorithms with or without the adaptive selection strategy numerically and illustrate the gain achieved by being adaptive.

摘要我们研究了根据二进制用户反馈对一组项目进行聚类的问题。这种问题出现在众包平台上，用户只需付出最小的努力就能解决大规模的标记任务。例如，在最近的一些 reCAPTCHA 系统中，用户的点击（二进制答案）可以用来有效地标记图像。在我们的推理问题中，项目被归入最初未知的非重叠群组。为了恢复这些群集，学习者会依次向用户展示一个有限的项目列表，以及一个从固定的有限集合中选出的带有二进制答案的问题。对于每个项目，用户都会提供一个噪声答案，其期望值由项目群和问题以及表征项目分类难易程度的特定项目参数决定。我们的目标是设计一种具有最小群组恢复错误率的算法。我们针对统一和自适应（列表、问题）选择策略，推导出了任何算法所满足的错误率的特定问题信息理论下限。对于统一选择，我们提出了一种建立在 K-means 算法基础上的简单算法，其性能几乎与基本限制相匹配。对于自适应选择，我们开发了一种自适应算法，该算法受到信息论误差下限推导的启发，进而以一种有效的方式分配预算。该算法学会更频繁地选择难以聚类的项目和相关问题。我们用数字比较了有无自适应选择策略的算法性能，并说明了自适应所带来的收益。

{"title":"Optimal clustering from noisy binary feedback","authors":"","doi":"10.1007/s10994-024-06532-z","DOIUrl":"https://doi.org/10.1007/s10994-024-06532-z","url":null,"abstract":"<h3>Abstract</h3> We study the problem of clustering a set of items from binary user feedback. Such a problem arises in crowdsourcing platforms solving large-scale labeling tasks with minimal effort put on the users. For example, in some of the recent reCAPTCHA systems, users clicks (binary answers) can be used to efficiently label images. In our inference problem, items are grouped into initially unknown non-overlapping clusters. To recover these clusters, the learner sequentially presents to users a finite list of items together with a question with a binary answer selected from a fixed finite set. For each of these items, the user provides a noisy answer whose expectation is determined by the item cluster and the question and by an item-specific parameter characterizing the hardness of classifying the item. The objective is to devise an algorithm with a minimal cluster recovery error rate. We derive problem-specific information-theoretical lower bounds on the error rate satisfied by any algorithm, for both uniform and adaptive (list, question) selection strategies. For uniform selection, we present a simple algorithm built upon the K-means algorithm and whose performance almost matches the fundamental limits. For adaptive selection, we develop an adaptive algorithm that is inspired by the derivation of the information-theoretical error lower bounds, and in turn allocates the budget in an efficient way. The algorithm learns to select items hard to cluster and relevant questions more often. We compare the performance of our algorithms with or without the adaptive selection strategy numerically and illustrate the gain achieved by being adaptive.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"24 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140204495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TOCOL: improving contextual representation of pre-trained language models via token-level contrastive learning TOCOL：通过标记级对比学习改进预训练语言模型的语境表征

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-18 DOI: 10.1007/s10994-023-06512-9

Keheng Wang, Chuantao Yin, Rumei Li, Sirui Wang, Yunsen Xian, Wenge Rong, Zhang Xiong

Self-attention, which allows transformers to capture deep bidirectional contexts, plays a vital role in BERT-like pre-trained language models. However, the maximum likelihood pre-training objective of BERT may produce an anisotropic word embedding space, which leads to biased attention scores for high-frequency tokens, as they are very close to each other in representation space and thus have higher similarities. This bias may ultimately affect the encoding of global contextual information. To address this issue, we propose TOCOL, a TOken-Level COntrastive Learning framework for improving the contextual representation of pre-trained language models, which integrates a novel self-supervised objective to the attention mechanism to reshape the word representation space and encourages PLM to capture the global semantics of sentences. Results on the GLUE Benchmark show that TOCOL brings considerable improvement over the original BERT. Furthermore, we conduct a detailed analysis and demonstrate the robustness of our approach for low-resource scenarios.

自我关注允许转换器捕捉深度双向语境，在类似 BERT 的预训练语言模型中发挥着重要作用。然而，BERT 的最大似然预训练目标可能会产生一个各向异性的单词嵌入空间，从而导致高频词块的注意力得分出现偏差，因为这些词块在表征空间中彼此非常接近，因此具有较高的相似性。这种偏差最终可能会影响全局语境信息的编码。为了解决这个问题，我们提出了 TOCOL（一种用于改进预训练语言模型上下文表征的词级对比学习框架），它将一种新颖的自监督目标整合到注意力机制中，以重塑词的表征空间，并鼓励 PLM 捕捉句子的全局语义。GLUE 基准测试结果表明，TOCOL 比原来的 BERT 有了很大的改进。此外，我们还进行了详细分析，证明了我们的方法在低资源场景下的鲁棒性。

{"title":"TOCOL: improving contextual representation of pre-trained language models via token-level contrastive learning","authors":"Keheng Wang, Chuantao Yin, Rumei Li, Sirui Wang, Yunsen Xian, Wenge Rong, Zhang Xiong","doi":"10.1007/s10994-023-06512-9","DOIUrl":"https://doi.org/10.1007/s10994-023-06512-9","url":null,"abstract":"Self-attention, which allows transformers to capture deep bidirectional contexts, plays a vital role in BERT-like pre-trained language models. However, the maximum likelihood pre-training objective of BERT may produce an anisotropic word embedding space, which leads to biased attention scores for high-frequency tokens, as they are very close to each other in representation space and thus have higher similarities. This bias may ultimately affect the encoding of global contextual information. To address this issue, we propose TOCOL, a TOken-Level COntrastive Learning framework for improving the contextual representation of pre-trained language models, which integrates a novel self-supervised objective to the attention mechanism to reshape the word representation space and encourages PLM to capture the global semantics of sentences. Results on the GLUE Benchmark show that TOCOL brings considerable improvement over the original BERT. Furthermore, we conduct a detailed analysis and demonstrate the robustness of our approach for low-resource scenarios.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"9 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140168796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stress detection with encoding physiological signals and convolutional neural network 利用生理信号编码和卷积神经网络进行压力检测

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-15 DOI: 10.1007/s10994-023-06509-4

Michela Quadrini, Antonino Capuccio, Denise Falcone, Sebastian Daberdaku, Alessandro Blanda, Luca Bellanova, Gianluca Gerard

Stress is a significant and growing phenomenon in the modern world that leads to numerous health problems. Robust and non-invasive method developments for early and accurate stress detection are crucial in enhancing people’s quality of life. Previous researches show that using machine learning approaches on physiological signals is a reliable stress predictor by achieving significant results. However, it requires determining features by hand. Such a selection is a challenge in this context since stress determines nonspecific human responses. This work overcomes such limitations by considering STREDWES, an approach for Stress Detection from Wearable Sensors Data. STREDWES encodes signal fragments of physiological signals into images and classifies them by a Convolutional Neural Network (CNN). This study aims to study several encoding methods, including the Gramian Angular Summation/Difference Field method and Markov Transition Field, to evaluate the best way to encode signals into images in this domain. Such a study is performed on the NEURO dataset. Moreover, we investigate the usefulness of STREDWES in real scenarios by considering the SWELL dataset and a personalized approach. Finally, we compare the proposed approach with its competitors by considering the WESAD dataset. It outperforms the others.

压力是现代社会中一个重要且日益增长的现象，会导致许多健康问题。开发可靠的非侵入性方法，用于早期准确检测压力，对于提高人们的生活质量至关重要。以往的研究表明，在生理信号上使用机器学习方法是一种可靠的压力预测方法，能取得显著效果。然而，这需要人工确定特征。在这种情况下，这种选择是一个挑战，因为压力决定了人类的非特异性反应。STREDWES 是一种从可穿戴传感器数据中进行压力检测的方法，这项研究通过考虑 STREDWES 克服了上述局限性。STREDWES 将生理信号的信号片段编码成图像，并通过卷积神经网络（CNN）对其进行分类。本研究旨在研究几种编码方法，包括格拉米安角相加/差分场法和马尔可夫转换场法，以评估在该领域将信号编码成图像的最佳方法。这项研究是在 NEURO 数据集上进行的。此外，我们还通过考虑 SWELL 数据集和个性化方法，研究了 STREDWES 在实际场景中的实用性。最后，我们通过 WESAD 数据集将所提出的方法与其竞争对手进行了比较。结果显示，该方法优于其他方法。

{"title":"Stress detection with encoding physiological signals and convolutional neural network","authors":"Michela Quadrini, Antonino Capuccio, Denise Falcone, Sebastian Daberdaku, Alessandro Blanda, Luca Bellanova, Gianluca Gerard","doi":"10.1007/s10994-023-06509-4","DOIUrl":"https://doi.org/10.1007/s10994-023-06509-4","url":null,"abstract":"Stress is a significant and growing phenomenon in the modern world that leads to numerous health problems. Robust and non-invasive method developments for early and accurate stress detection are crucial in enhancing people’s quality of life. Previous researches show that using machine learning approaches on physiological signals is a reliable stress predictor by achieving significant results. However, it requires determining features by hand. Such a selection is a challenge in this context since stress determines nonspecific human responses. This work overcomes such limitations by considering STREDWES, an approach for Stress Detection from Wearable Sensors Data. STREDWES encodes signal fragments of physiological signals into images and classifies them by a Convolutional Neural Network (CNN). This study aims to study several encoding methods, including the Gramian Angular Summation/Difference Field method and Markov Transition Field, to evaluate the best way to encode signals into images in this domain. Such a study is performed on the NEURO dataset. Moreover, we investigate the usefulness of STREDWES in real scenarios by considering the SWELL dataset and a personalized approach. Finally, we compare the proposed approach with its competitors by considering the WESAD dataset. It outperforms the others.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"8 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140152360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Glacier: guided locally constrained counterfactual explanations for time series classification 冰川：引导时间序列分类的局部约束反事实解释

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-13 DOI: 10.1007/s10994-023-06502-x

Zhendong Wang, Isak Samsten, Ioanna Miliou, Rami Mochaourab, Panagiotis Papapetrou

In machine learning applications, there is a need to obtain predictive models of high performance and, most importantly, to allow end-users and practitioners to understand and act on their predictions. One way to obtain such understanding is via counterfactuals, that provide sample-based explanations in the form of recommendations on which features need to be modified from a test example so that the classification outcome of a given classifier changes from an undesired outcome to a desired one. This paper focuses on the domain of time series classification, more specifically, on defining counterfactual explanations for univariate time series. We propose Glacier, a model-agnostic method for generating locally-constrained counterfactual explanations for time series classification using gradient search either on the original space or on a latent space that is learned through an auto-encoder. An additional flexibility of our method is the inclusion of constraints on the counterfactual generation process that favour applying changes to particular time series points or segments while discouraging changing others. The main purpose of these constraints is to ensure more reliable counterfactuals, while increasing the efficiency of the counterfactual generation process. Two particular types of constraints are considered, i.e., example-specific constraints and global constraints. We conduct extensive experiments on 40 datasets from the UCR archive, comparing different instantiations of Glacier against three competitors. Our findings suggest that Glacier outperforms the three competitors in terms of two common metrics for counterfactuals, i.e., proximity and compactness. Moreover, Glacier obtains comparable counterfactual validity compared to the best of the three competitors. Finally, when comparing the unconstrained variant of Glacier to the constraint-based variants, we conclude that the inclusion of example-specific and global constraints yields a good performance while demonstrating the trade-off between the different metrics.

在机器学习应用中，需要获得高性能的预测模型，最重要的是，要让最终用户和从业人员能够理解其预测结果，并根据预测结果采取行动。反事实是获得这种理解的一种方法，它以建议的形式提供基于样本的解释，说明需要从测试示例中修改哪些特征，从而使给定分类器的分类结果从不尽人意变为理想结果。本文的重点是时间序列分类领域，更具体地说，是定义单变量时间序列的反事实解释。我们提出了一种与模型无关的方法--Glacier，这种方法可以在原始空间或通过自动编码器学习的潜在空间上使用梯度搜索，为时间序列分类生成局部受限的反事实解释。我们的方法还具有额外的灵活性，即在反事实生成过程中加入了一些约束条件，这些约束条件有利于对特定的时间序列点或片段进行更改，而不鼓励更改其他点或片段。这些约束的主要目的是确保更可靠的反事实，同时提高反事实生成过程的效率。我们考虑了两种特殊类型的约束，即特定实例约束和全局约束。我们在 UCR 档案中的 40 个数据集上进行了广泛的实验，将 Glacier 的不同实例与三个竞争对手进行了比较。我们的研究结果表明，Glacier 在反事实的两个通用指标（即接近性和紧凑性）方面优于三个竞争对手。此外，Glacier 还获得了与三位竞争者中最好的一位相当的反事实有效性。最后，在比较 Glacier 的无约束变体和基于约束的变体时，我们得出结论：包含特定实例约束和全局约束会产生良好的性能，同时证明了不同指标之间的权衡。

{"title":"Glacier: guided locally constrained counterfactual explanations for time series classification","authors":"Zhendong Wang, Isak Samsten, Ioanna Miliou, Rami Mochaourab, Panagiotis Papapetrou","doi":"10.1007/s10994-023-06502-x","DOIUrl":"https://doi.org/10.1007/s10994-023-06502-x","url":null,"abstract":"In machine learning applications, there is a need to obtain predictive models of high performance and, most importantly, to allow end-users and practitioners to understand and act on their predictions. One way to obtain such understanding is via counterfactuals, that provide sample-based explanations in the form of recommendations on which features need to be modified from a test example so that the classification outcome of a given classifier changes from an undesired outcome to a desired one. This paper focuses on the domain of time series classification, more specifically, on defining counterfactual explanations for univariate time series. We propose Glacier, a model-agnostic method for generating locally-constrained counterfactual explanations for time series classification using gradient search either on the original space or on a latent space that is learned through an auto-encoder. An additional flexibility of our method is the inclusion of constraints on the counterfactual generation process that favour applying changes to particular time series points or segments while discouraging changing others. The main purpose of these constraints is to ensure more reliable counterfactuals, while increasing the efficiency of the counterfactual generation process. Two particular types of constraints are considered, i.e., example-specific constraints and global constraints. We conduct extensive experiments on 40 datasets from the UCR archive, comparing different instantiations of Glacier against three competitors. Our findings suggest that Glacier outperforms the three competitors in terms of two common metrics for counterfactuals, i.e., proximity and compactness. Moreover, Glacier obtains comparable counterfactual validity compared to the best of the three competitors. Finally, when comparing the unconstrained variant of Glacier to the constraint-based variants, we conclude that the inclusion of example-specific and global constraints yields a good performance while demonstrating the trade-off between the different metrics.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"24 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140125822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Neural network relief: a pruning algorithm based on neural activity 神经网络救济：基于神经活动的剪枝算法

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-05 DOI: 10.1007/s10994-024-06516-z

Abstract

Current deep neural networks (DNNs) are overparameterized and use most of their neuronal connections during inference for each task. The human brain, however, developed specialized regions for different tasks and performs inference with a small fraction of its neuronal connections. We propose an iterative pruning strategy introducing a simple importance-score metric that deactivates unimportant connections, tackling overparameterization in DNNs and modulating the firing patterns. The aim is to find the smallest number of connections that is still capable of solving a given task with comparable accuracy, i.e. a simpler subnetwork. We achieve comparable performance for LeNet architectures on MNIST, and significantly higher parameter compression than state-of-the-art algorithms for VGG and ResNet architectures on CIFAR-10/100 and Tiny-ImageNet. Our approach also performs well for the two different optimizers considered—Adam and SGD. The algorithm is not designed to minimize FLOPs when considering current hardware and software implementations, although it performs reasonably when compared to the state of the art.

摘要当前的深度神经网络（DNN）参数过高，在推理每个任务时都会使用大部分神经元连接。然而，人类大脑为不同的任务开发了专门的区域，并使用其神经元连接的一小部分执行推理。我们提出了一种迭代剪枝策略，引入了一个简单的重要性分数指标，停用不重要的连接，解决 DNN 中的过度参数化问题，并调节发射模式。这样做的目的是找到最小数量的连接，这些连接仍能以相当的精度解决给定任务，即一个更简单的子网络。我们在 MNIST 上实现了与 LeNet 架构相当的性能，在 CIFAR-10/100 和 Tiny-ImageNet 上实现了比 VGG 和 ResNet 架构先进算法更高的参数压缩率。我们的方法在两种不同的优化器--Adam 和 SGD 中也表现出色。考虑到当前的硬件和软件实现，该算法并不是为了最小化 FLOPs 而设计的，不过与最先进的算法相比，它的表现还算合理。

引用次数: 0

Tackle balancing constraints in semi-supervised ordinal regression 解决半监督序数回归中的平衡约束问题

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-03-04 DOI: 10.1007/s10994-024-06518-x

Chenkang Zhang, Heng Huang, Bin Gu

Semi-supervised ordinal regression (S²OR) has been recognized as a valuable technique to improve the performance of the ordinal regression (OR) model by leveraging available unlabeled samples. The balancing constraint is a useful approach for semi-supervised algorithms, as it can prevent the trivial solution of classifying a large number of unlabeled examples into a few classes. However, rapid training of the S²OR model with balancing constraints is still an open problem due to the difficulty in formulating and solving the corresponding optimization objective. To tackle this issue, we propose a novel form of balancing constraints and extend the traditional convex–concave procedure (CCCP) approach to solve our objective function. Additionally, we transform the convex inner loop (CIL) problem generated by the CCCP approach into a quadratic problem that resembles support vector machine, where multiple equality constraints are treated as virtual samples. As a result, we can utilize the existing fast solver to efficiently solve the CIL problem. Experimental results conducted on several benchmark and real-world datasets not only validate the effectiveness of our proposed algorithm but also demonstrate its superior performance compared to other supervised and semi-supervised algorithms

半监督序数回归（S2OR）已被公认为是一种有价值的技术，可以利用现有的未标记样本提高序数回归（OR）模型的性能。对于半监督算法来说，平衡约束是一种有用的方法，因为它可以避免将大量未标记样本归入少数几个类别的琐碎解决方案。然而，由于难以制定和求解相应的优化目标，快速训练具有平衡约束的 S2OR 模型仍是一个未决问题。为了解决这个问题，我们提出了一种新的平衡约束形式，并扩展了传统的凸-凹过程（CCCP）方法来求解我们的目标函数。此外，我们将 CCCP 方法生成的凸内循环 (CIL) 问题转化为类似于支持向量机的二次问题，其中多个相等约束被视为虚拟样本。因此，我们可以利用现有的快速求解器来高效解决 CIL 问题。在多个基准数据集和现实世界数据集上进行的实验结果不仅验证了我们提出的算法的有效性，还证明了它与其他监督和半监督算法相比的卓越性能

{"title":"Tackle balancing constraints in semi-supervised ordinal regression","authors":"Chenkang Zhang, Heng Huang, Bin Gu","doi":"10.1007/s10994-024-06518-x","DOIUrl":"https://doi.org/10.1007/s10994-024-06518-x","url":null,"abstract":"Semi-supervised ordinal regression (S2OR) has been recognized as a valuable technique to improve the performance of the ordinal regression (OR) model by leveraging available unlabeled samples. The balancing constraint is a useful approach for semi-supervised algorithms, as it can prevent the trivial solution of classifying a large number of unlabeled examples into a few classes. However, rapid training of the S2OR model with balancing constraints is still an open problem due to the difficulty in formulating and solving the corresponding optimization objective. To tackle this issue, we propose a novel form of balancing constraints and extend the traditional convex–concave procedure (CCCP) approach to solve our objective function. Additionally, we transform the convex inner loop (CIL) problem generated by the CCCP approach into a quadratic problem that resembles support vector machine, where multiple equality constraints are treated as virtual samples. As a result, we can utilize the existing fast solver to efficiently solve the CIL problem. Experimental results conducted on several benchmark and real-world datasets not only validate the effectiveness of our proposed algorithm but also demonstrate its superior performance compared to other supervised and semi-supervised algorithms","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"11 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140035943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An encoding approach for stable change point detection 稳定变化点检测编码方法

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-02-28 DOI: 10.1007/s10994-023-06510-x

Xiaodong Wang, Fushing Hsieh

Without imposing prior distributional knowledge underlying multivariate time series of interest, we propose a nonparametric change-point detection approach to estimate the number of change points and their locations along the temporal axis. We develop a structural subsampling procedure such that the observations are encoded into multiple sequences of Bernoulli variables. A maximum likelihood approach in conjunction with a newly developed searching algorithm is implemented to detect change points on each Bernoulli process separately. Then, aggregation statistics are proposed to collectively synthesize change-point results from all individual univariate time series into consistent and stable location estimations. We also study a weighting strategy to measure the degree of relevance for different subsampled groups. Simulation studies are conducted and shown that the proposed change-point methodology for multivariate time series has favorable performance comparing with currently available state-of-the-art nonparametric methods under various settings with different degrees of complexity. Real data analyses are finally performed on categorical, ordinal, and continuous time series taken from fields of genetics, climate, and finance.

我们提出了一种非参数变化点检测方法，在不强加相关多元时间序列的先验分布知识的情况下，估计变化点的数量及其沿时间轴的位置。我们开发了一种结构性子采样程序，将观测数据编码为多个伯努利变量序列。最大似然法与新开发的搜索算法相结合，分别检测每个伯努利过程的变化点。然后，我们提出了聚合统计方法，将所有单变量时间序列的变化点结果综合为一致且稳定的位置估计。我们还研究了一种加权策略，用于衡量不同子采样组的相关程度。我们进行了模拟研究，结果表明，在复杂程度不同的各种设置下，针对多变量时间序列提出的变化点方法与目前可用的最先进的非参数方法相比，具有良好的性能。最后对遗传学、气候和金融领域的分类、序数和连续时间序列进行了真实数据分析。

{"title":"An encoding approach for stable change point detection","authors":"Xiaodong Wang, Fushing Hsieh","doi":"10.1007/s10994-023-06510-x","DOIUrl":"https://doi.org/10.1007/s10994-023-06510-x","url":null,"abstract":"Without imposing prior distributional knowledge underlying multivariate time series of interest, we propose a nonparametric change-point detection approach to estimate the number of change points and their locations along the temporal axis. We develop a structural subsampling procedure such that the observations are encoded into multiple sequences of Bernoulli variables. A maximum likelihood approach in conjunction with a newly developed searching algorithm is implemented to detect change points on each Bernoulli process separately. Then, aggregation statistics are proposed to collectively synthesize change-point results from all individual univariate time series into consistent and stable location estimations. We also study a weighting strategy to measure the degree of relevance for different subsampled groups. Simulation studies are conducted and shown that the proposed change-point methodology for multivariate time series has favorable performance comparing with currently available state-of-the-art nonparametric methods under various settings with different degrees of complexity. Real data analyses are finally performed on categorical, ordinal, and continuous time series taken from fields of genetics, climate, and finance.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"630 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fair and green hyperparameter optimization via multi-objective and multiple information source Bayesian optimization 通过多目标和多信息源贝叶斯优化实现公平和绿色超参数优化

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-02-28 DOI: 10.1007/s10994-024-06515-0

Abstract

It has been recently remarked that focusing only on accuracy in searching for optimal Machine Learning models amplifies biases contained in the data, leading to unfair predictions and decision supports. Recently, multi-objective hyperparameter optimization has been proposed to search for Machine Learning models which offer equally Pareto-efficient trade-offs between accuracy and fairness. Although these approaches proved to be more versatile than fairness-aware Machine Learning algorithms—which instead optimize accuracy constrained to some threshold on fairness—their carbon footprint could be dramatic, due to the large amount of energy required in the case of large datasets. We propose an approach named FanG-HPO: fair and green hyperparameter optimization (HPO), based on both multi-objective and multiple information source Bayesian optimization. FanG-HPO uses subsets of the large dataset to obtain cheap approximations (aka information sources) of both accuracy and fairness, and multi-objective Bayesian optimization to efficiently identify Pareto-efficient (accurate and fair) Machine Learning models. Experiments consider four benchmark (fairness) datasets and four Machine Learning algorithms, and provide an assessment of FanG-HPO against both fairness-aware Machine Learning approaches and two state-of-the-art Bayesian optimization tools addressing multi-objective and energy-aware optimization.

摘要最近有人指出，在寻找最佳机器学习模型时只关注准确性会放大数据中的偏差，从而导致不公平的预测和决策支持。最近，有人提出了多目标超参数优化方法，以寻找在准确性和公平性之间提供同等帕累托效率权衡的机器学习模型。虽然这些方法被证明比公平感知的机器学习算法更具通用性--这些算法在优化准确性的同时也会限制公平性的某些阈值--但由于在大型数据集的情况下需要消耗大量能源，它们的碳足迹可能会非常惊人。我们提出了一种名为 FanG-HPO 的方法：基于多目标和多信息源贝叶斯优化的公平绿色超参数优化（HPO）。FanG-HPO 利用大型数据集的子集来获得准确性和公平性的廉价近似值（又称信息源），并利用多目标贝叶斯优化来有效识别帕累托效率（准确性和公平性）机器学习模型。实验考虑了四个基准（公平性）数据集和四种机器学习算法，并对照公平性感知机器学习方法和两种解决多目标和能量感知优化问题的最先进贝叶斯优化工具，对 FanG-HPO 进行了评估。

{"title":"Fair and green hyperparameter optimization via multi-objective and multiple information source Bayesian optimization","authors":"","doi":"10.1007/s10994-024-06515-0","DOIUrl":"https://doi.org/10.1007/s10994-024-06515-0","url":null,"abstract":"<h3>Abstract</h3> It has been recently remarked that focusing only on accuracy in searching for optimal Machine Learning models amplifies biases contained in the data, leading to unfair predictions and decision supports. Recently, multi-objective hyperparameter optimization has been proposed to search for Machine Learning models which offer equally Pareto-efficient trade-offs between accuracy and fairness. Although these approaches proved to be more versatile than fairness-aware Machine Learning algorithms—which instead optimize accuracy constrained to some threshold on fairness—their carbon footprint could be dramatic, due to the large amount of energy required in the case of large datasets. We propose an approach named FanG-HPO: fair and green hyperparameter optimization (HPO), based on both multi-objective and multiple information source Bayesian optimization. FanG-HPO uses subsets of the large dataset to obtain cheap approximations (aka information sources) of both accuracy and fairness, and multi-objective Bayesian optimization to efficiently identify Pareto-efficient (accurate and fair) Machine Learning models. Experiments consider four benchmark (fairness) datasets and four Machine Learning algorithms, and provide an assessment of FanG-HPO against both fairness-aware Machine Learning approaches and two state-of-the-art Bayesian optimization tools addressing multi-objective and energy-aware optimization.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"17 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic datasets and market environments for financial reinforcement learning 金融强化学习的动态数据集和市场环境

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-02-26 DOI: 10.1007/s10994-023-06511-w

Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo

The financial market is a particularly challenging playground for deep reinforcement learning due to its unique feature of dynamic datasets. Building high-quality market environments for training financial reinforcement learning (FinRL) agents is difficult due to major factors such as the low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting. In this paper, we present an updated version of FinRL-Meta, a data-centric and openly accessible library that processes dynamic datasets from real-world markets into gym-style market environments and has been actively maintained by the AI4Finance community. First, following a DataOps paradigm, we provide hundreds of market environments through an automatic data curation pipeline. Second, we provide homegrown examples and reproduce popular research papers as stepping stones for users to design new trading strategies. We also deploy the library on cloud platforms so that users can visualize their own results and assess the relative performance via community-wise competitions. Third, we provide dozens of Jupyter/Python demos organized into a curriculum and a documentation website to serve the rapidly growing community. The codes are available at https://github.com/AI4Finance-Foundation/FinRL-Meta

由于动态数据集的独特性，金融市场对于深度强化学习来说是一个特别具有挑战性的领域。由于金融数据信噪比低、历史数据的幸存者偏差和模型过拟合等主要因素，为训练金融强化学习（FinRL）代理构建高质量的市场环境非常困难。在本文中，我们介绍了 FinRL-Meta 的更新版本，这是一个以数据为中心、可公开访问的库，可将来自真实市场的动态数据集处理成健身房风格的市场环境，并一直由 AI4Finance 社区积极维护。首先，我们遵循 DataOps 范式，通过自动数据整理管道提供了数百种市场环境。其次，我们提供自制示例并转载热门研究论文，作为用户设计新交易策略的垫脚石。我们还将库部署在云平台上，以便用户可视化自己的结果，并通过社区竞赛评估相对性能。第三，我们提供了数十个 Jupyter/Python 演示，并将其整理成课程和文档网站，为快速发展的社区提供服务。这些代码可在 https://github.com/AI4Finance-Foundation/FinRL-Meta

{"title":"Dynamic datasets and market environments for financial reinforcement learning","authors":"Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo","doi":"10.1007/s10994-023-06511-w","DOIUrl":"https://doi.org/10.1007/s10994-023-06511-w","url":null,"abstract":"The financial market is a particularly challenging playground for deep reinforcement learning due to its unique feature of dynamic datasets. Building high-quality market environments for training financial reinforcement learning (FinRL) agents is difficult due to major factors such as the low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting. In this paper, we present an updated version of FinRL-Meta, a data-centric and openly accessible library that processes dynamic datasets from real-world markets into gym-style market environments and has been actively maintained by the AI4Finance community. First, following a DataOps paradigm, we provide hundreds of market environments through an automatic data curation pipeline. Second, we provide homegrown examples and reproduce popular research papers as stepping stones for users to design new trading strategies. We also deploy the library on cloud platforms so that users can visualize their own results and assess the relative performance via community-wise competitions. Third, we provide dozens of Jupyter/Python demos organized into a curriculum and a documentation website to serve the rapidly growing community. The codes are available at https://github.com/AI4Finance-Foundation/FinRL-Meta","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"2015 2","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139981410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Re-attentive experience replay in off-policy reinforcement learning 非政策强化学习中的再注意经验重放

IF 7.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning

Pub Date : 2024-02-22 DOI: 10.1007/s10994-023-06505-8

Wei Wei, Da Wang, Lin Li, Jiye Liang

Experience replay, which stores past samples for reuse, has become a fundamental component of off-policy reinforcement learning. Some pioneering works have indicated that prioritization or reweighting of samples with on-policiness can yield significant performance improvements. However, this method doesn’t pay enough attention to sample diversity, which may result in instability or even long-term performance slumps. In this work, we introduce a novel Re-attention criterion to reevaluate recent experiences, thus benefiting the agent from learning about them. We call this overall algorithm, Re-attentive Experience Replay (RAER). RAER employs a parameter-insensitive dynamic testing technique to enhance the attention of samples generated by policies with promising trends in overall performance. By wisely leveraging diverse samples, RAER fulfills the positive effects of on-policiness while avoiding its potential negative influences. Extensive experiments demonstrate the effectiveness of RAER in improving both performance and stability. Moreover, replacing the on-policiness component of the state-of-the-art approach with RAER can yield significant benefits.

经验重放可以存储过去的样本以供重复使用，已成为非政策强化学习的基本组成部分。一些开创性的工作表明，对具有政策性的样本进行优先排序或重新加权可以显著提高性能。然而，这种方法对样本多样性关注不够，可能会导致性能不稳定甚至长期下滑。在这项工作中，我们引入了一种新颖的 "重新关注"（Re-attention）准则来重新评估最近的经验，从而使代理从学习这些经验中受益。我们将这一整体算法称为 "重新关注经验重放"（RAER）。RAER 采用了对参数不敏感的动态测试技术，以加强对总体性能趋势良好的策略所产生的样本的关注。通过明智地利用各种样本，RAER 在避免政策潜在负面影响的同时，还能发挥政策的积极作用。大量实验证明，RAER 在提高性能和稳定性方面都很有效。此外，用 RAER 取代最先进方法中的 "警戒性 "部分也能产生显著效果。

{"title":"Re-attentive experience replay in off-policy reinforcement learning","authors":"Wei Wei, Da Wang, Lin Li, Jiye Liang","doi":"10.1007/s10994-023-06505-8","DOIUrl":"https://doi.org/10.1007/s10994-023-06505-8","url":null,"abstract":"Experience replay, which stores past samples for reuse, has become a fundamental component of off-policy reinforcement learning. Some pioneering works have indicated that prioritization or reweighting of samples with on-policiness can yield significant performance improvements. However, this method doesn’t pay enough attention to sample diversity, which may result in instability or even long-term performance slumps. In this work, we introduce a novel Re-attention criterion to reevaluate recent experiences, thus benefiting the agent from learning about them. We call this overall algorithm, Re-attentive Experience Replay (RAER). RAER employs a parameter-insensitive dynamic testing technique to enhance the attention of samples generated by policies with promising trends in overall performance. By wisely leveraging diverse samples, RAER fulfills the positive effects of on-policiness while avoiding its potential negative influences. Extensive experiments demonstrate the effectiveness of RAER in improving both performance and stability. Moreover, replacing the on-policiness component of the state-of-the-art approach with RAER can yield significant benefits.","PeriodicalId":49900,"journal":{"name":"Machine Learning","volume":"23 1","pages":""},"PeriodicalIF":7.5,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139925870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0