首页 > 最新文献

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)最新文献

英文 中文
Explaining Full-disk Deep Learning Model for Solar Flare Prediction using Attribution Methods 解释利用归因方法预测太阳耀斑的全盘深度学习模型
Chetraj Pandey, R. Angryk, Berkay Aydin
This paper contributes to the growing body of research on deep learning methods for solar flare prediction, primarily focusing on highly overlooked near-limb flares and utilizing the attribution methods to provide a post hoc qualitative explanation of the model's predictions. We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images and employs a binary prediction mode to forecast $geq$M-class flares that may occur within the following 24-hour period. To address the class imbalance, we employ a fusion of data augmentation and class weighting techniques; and evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS). Moreover, we applied three attribution methods, namely Guided Gradient-weighted Class Activation Mapping, Integrated Gradients, and Deep Shapley Additive Explanations, to interpret and cross-validate our model's predictions with the explanations. Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs). In particular, the key findings of this study are: (1) our deep learning models achieved an average TSS=0.51 and HSS=0.35, and the results further demonstrate a competent capability to predict near-limb solar flares and (2) the qualitative analysis of the model explanation indicates that our model identifies and uses features associated with ARs in central and near-limb locations from full-disk magnetograms to make corresponding predictions. In other words, our models learn the shape and texture-based characteristics of flaring ARs even at near-limb areas, which is a novel and critical capability with significant implications for operational forecasting.
本文为太阳耀斑预测的深度学习方法研究做出了贡献,主要关注高度被忽视的近翼耀斑,并利用归因方法为模型预测提供事后定性解释。我们提出了一个太阳耀斑预测模型,该模型使用每小时全盘视距磁图图像进行训练,并采用二元预测模式来预测在接下来的24小时内可能发生的$geq$ m级耀斑。为了解决类不平衡问题,我们采用了数据增强和类加权技术的融合;并使用真实技能统计量(TSS)和海德克技能分数(HSS)来评估我们模型的整体性能。此外,我们应用了三种归因方法,即Guided Gradient-weighted Class Activation Mapping、Integrated Gradients和Deep Shapley Additive explanation,来解释和交叉验证我们模型的预测结果。我们的分析表明,太阳耀斑的全盘预测与活动区(ARs)相关的特征一致。特别是,本研究的主要发现是:(1)我们的深度学习模型实现了平均TSS=0.51和HSS=0.35,结果进一步证明了预测近翼太阳耀斑的能力;(2)对模型解释的定性分析表明,我们的模型识别并利用全盘磁图中与中心和近翼位置的ARs相关的特征进行了相应的预测。换句话说,我们的模型甚至可以在近肢区域学习燃烧ARs的形状和纹理特征,这是一种新颖而关键的能力,对业务预测具有重要意义。
{"title":"Explaining Full-disk Deep Learning Model for Solar Flare Prediction using Attribution Methods","authors":"Chetraj Pandey, R. Angryk, Berkay Aydin","doi":"10.48550/arXiv.2307.15878","DOIUrl":"https://doi.org/10.48550/arXiv.2307.15878","url":null,"abstract":"This paper contributes to the growing body of research on deep learning methods for solar flare prediction, primarily focusing on highly overlooked near-limb flares and utilizing the attribution methods to provide a post hoc qualitative explanation of the model's predictions. We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images and employs a binary prediction mode to forecast $geq$M-class flares that may occur within the following 24-hour period. To address the class imbalance, we employ a fusion of data augmentation and class weighting techniques; and evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS). Moreover, we applied three attribution methods, namely Guided Gradient-weighted Class Activation Mapping, Integrated Gradients, and Deep Shapley Additive Explanations, to interpret and cross-validate our model's predictions with the explanations. Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs). In particular, the key findings of this study are: (1) our deep learning models achieved an average TSS=0.51 and HSS=0.35, and the results further demonstrate a competent capability to predict near-limb solar flares and (2) the qualitative analysis of the model explanation indicates that our model identifies and uses features associated with ARs in central and near-limb locations from full-disk magnetograms to make corresponding predictions. In other words, our models learn the shape and texture-based characteristics of flaring ARs even at near-limb areas, which is a novel and critical capability with significant implications for operational forecasting.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86578292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Offline Reinforcement Learning with On-Policy Q-Function Regularization 基于策略q函数正则化的离线强化学习
Laixi Shi, Robert Dadashi, Yuejie Chi, P. S. Castro, M. Geist
The core challenge of offline reinforcement learning (RL) is dealing with the (potentially catastrophic) extrapolation error induced by the distribution shift between the history dataset and the desired policy. A large portion of prior work tackles this challenge by implicitly/explicitly regularizing the learning policy towards the behavior policy, which is hard to estimate reliably in practice. In this work, we propose to regularize towards the Q-function of the behavior policy instead of the behavior policy itself, under the premise that the Q-function can be estimated more reliably and easily by a SARSA-style estimate and handles the extrapolation error more straightforwardly. We propose two algorithms taking advantage of the estimated Q-function through regularizations, and demonstrate they exhibit strong performance on the D4RL benchmarks.
离线强化学习(RL)的核心挑战是处理由历史数据集和期望策略之间的分布变化引起的(潜在的灾难性)外推误差。先前的大部分工作通过隐式/显式地将学习策略规范化到行为策略来解决这一挑战,这在实践中很难可靠地估计。在这项工作中,我们提出对行为策略的q函数而不是行为策略本身进行正则化,前提是q函数可以通过sarsa式估计更可靠和容易地估计,并且更直接地处理外推误差。我们通过正则化提出了两种利用估计q函数的算法,并证明它们在D4RL基准测试中表现出强大的性能。
{"title":"Offline Reinforcement Learning with On-Policy Q-Function Regularization","authors":"Laixi Shi, Robert Dadashi, Yuejie Chi, P. S. Castro, M. Geist","doi":"10.48550/arXiv.2307.13824","DOIUrl":"https://doi.org/10.48550/arXiv.2307.13824","url":null,"abstract":"The core challenge of offline reinforcement learning (RL) is dealing with the (potentially catastrophic) extrapolation error induced by the distribution shift between the history dataset and the desired policy. A large portion of prior work tackles this challenge by implicitly/explicitly regularizing the learning policy towards the behavior policy, which is hard to estimate reliably in practice. In this work, we propose to regularize towards the Q-function of the behavior policy instead of the behavior policy itself, under the premise that the Q-function can be estimated more reliably and easily by a SARSA-style estimate and handles the extrapolation error more straightforwardly. We propose two algorithms taking advantage of the estimated Q-function through regularizations, and demonstrate they exhibit strong performance on the D4RL benchmarks.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87065147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations 可视化重叠双聚类和布尔矩阵分解
Thibault Marette, Pauli Miettinen, S. Neumann
Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing emph{a given clustering} of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.
在二部图中寻找(双)聚类是一种流行的数据分析方法。分析人员通常希望可视化集群,这很简单,只要集群是不相交的。然而,许多现代算法发现重叠簇,使可视化更加复杂。本文研究了二部图中重叠簇的emph{给定聚类}的可视化问题以及布尔矩阵分解的可视化问题。我们概念化了三个不同的目标,任何良好的可视化都应该满足:(1)集群元素的接近性,(2)来自同一集群的元素的大连续区域,以及(3)可视化中的大不间断区域,无论集群成员如何。我们提供了捕获这些目标的目标函数和优化这些目标函数的算法。有趣的是,在真实世界数据集的实验中,我们发现这些相互竞争的目标之间的最佳权衡是通过一种新颖的启发式方法实现的,该方法在局部目标是将具有相似集群成员的行和列放在彼此旁边。
{"title":"Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations","authors":"Thibault Marette, Pauli Miettinen, S. Neumann","doi":"10.48550/arXiv.2307.07396","DOIUrl":"https://doi.org/10.48550/arXiv.2307.07396","url":null,"abstract":"Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing emph{a given clustering} of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83587526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Examination of Wearable Sensors and Video Data Capture for Human Exercise Classification 用于人体运动分类的可穿戴传感器和视频数据捕获的研究
Ashish Singh, Antonio Bevilacqua, Timilehin B. Aderinola, Thach Le Nguyen, D. Whelan, M. O'Reilly, B. Caulfield, Georgiana Ifrim
Wearable sensors such as Inertial Measurement Units (IMUs) are often used to assess the performance of human exercise. Common approaches use handcrafted features based on domain expertise or automatically extracted features using time series analysis. Multiple sensors are required to achieve high classification accuracy, which is not very practical. These sensors require calibration and synchronization and may lead to discomfort over longer time periods. Recent work utilizing computer vision techniques has shown similar performance using video, without the need for manual feature engineering, and avoiding some pitfalls such as sensor calibration and placement on the body. In this paper, we compare the performance of IMUs to a video-based approach for human exercise classification on two real-world datasets consisting of Military Press and Rowing exercises. We compare the performance using a single camera that captures video in the frontal view versus using 5 IMUs placed on different parts of the body. We observe that an approach based on a single camera can outperform a single IMU by 10 percentage points on average. Additionally, a minimum of 3 IMUs are required to outperform a single camera. We observe that working with the raw data using multivariate time series classifiers outperforms traditional approaches based on handcrafted or automatically extracted features. Finally, we show that an ensemble model combining the data from a single camera with a single IMU outperforms either data modality. Our work opens up new and more realistic avenues for this application, where a video captured using a readily available smartphone camera, combined with a single sensor, can be used for effective human exercise classification.
惯性测量单元(imu)等可穿戴传感器通常用于评估人体运动的表现。常见的方法是使用基于领域专业知识的手工特征或使用时间序列分析自动提取特征。为了达到较高的分类精度,需要多个传感器,这不是很实用。这些传感器需要校准和同步,并且可能在较长时间内导致不适。最近利用计算机视觉技术的工作已经显示出类似的性能,使用视频,不需要手动特征工程,避免了一些陷阱,如传感器校准和放置在身体上。在本文中,我们将imu的性能与基于视频的人类运动分类方法进行了比较,该方法基于两个真实世界的数据集,包括军事新闻和划船运动。我们比较了使用单个摄像头捕获正面视图视频与使用放置在身体不同部位的5个imu的性能。我们观察到,基于单个相机的方法可以比单个IMU平均高出10个百分点。此外,至少需要3个imu才能胜过单个摄像机。我们观察到,使用多变量时间序列分类器处理原始数据优于基于手工制作或自动提取特征的传统方法。最后,我们证明了将来自单个相机的数据与单个IMU相结合的集成模型优于任何一种数据模式。我们的工作为这一应用开辟了新的、更现实的途径,其中使用现成的智能手机摄像头拍摄的视频,结合单个传感器,可用于有效的人体运动分类。
{"title":"An Examination of Wearable Sensors and Video Data Capture for Human Exercise Classification","authors":"Ashish Singh, Antonio Bevilacqua, Timilehin B. Aderinola, Thach Le Nguyen, D. Whelan, M. O'Reilly, B. Caulfield, Georgiana Ifrim","doi":"10.48550/arXiv.2307.04516","DOIUrl":"https://doi.org/10.48550/arXiv.2307.04516","url":null,"abstract":"Wearable sensors such as Inertial Measurement Units (IMUs) are often used to assess the performance of human exercise. Common approaches use handcrafted features based on domain expertise or automatically extracted features using time series analysis. Multiple sensors are required to achieve high classification accuracy, which is not very practical. These sensors require calibration and synchronization and may lead to discomfort over longer time periods. Recent work utilizing computer vision techniques has shown similar performance using video, without the need for manual feature engineering, and avoiding some pitfalls such as sensor calibration and placement on the body. In this paper, we compare the performance of IMUs to a video-based approach for human exercise classification on two real-world datasets consisting of Military Press and Rowing exercises. We compare the performance using a single camera that captures video in the frontal view versus using 5 IMUs placed on different parts of the body. We observe that an approach based on a single camera can outperform a single IMU by 10 percentage points on average. Additionally, a minimum of 3 IMUs are required to outperform a single camera. We observe that working with the raw data using multivariate time series classifiers outperforms traditional approaches based on handcrafted or automatically extracted features. Finally, we show that an ensemble model combining the data from a single camera with a single IMU outperforms either data modality. Our work opens up new and more realistic avenues for this application, where a video captured using a readily available smartphone camera, combined with a single sensor, can be used for effective human exercise classification.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90407831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Network Source Optimization with Graph-Kernel MAB 基于Graph-Kernel MAB的在线网络源优化
L. Toni, P. Frossard
We propose Grab-UCB, a graph-kernel multi-arms bandit algorithm to learn online the optimal source placement in large scale networks, such that the reward obtained from a priori unknown network processes is maximized. The uncertainty calls for online learning, which suffers however from the curse of dimensionality. To achieve sample efficiency, we describe the network processes with an adaptive graph dictionary model, which typically leads to sparse spectral representations. This enables a data-efficient learning framework, whose learning rate scales with the dimension of the spectral representation model instead of the one of the network. We then propose Grab-UCB, an online sequential decision strategy that learns the parameters of the spectral representation while optimizing the action strategy. We derive the performance guarantees that depend on network parameters, which further influence the learning curve of the sequential decision strategy We introduce a computationally simplified solving method, Grab-arm-Light, an algorithm that walks along the edges of the polytope representing the objective function. Simulations results show that the proposed online learning algorithm outperforms baseline offline methods that typically separate the learning phase from the testing one. The results confirm the theoretical findings, and further highlight the gain of the proposed online learning strategy in terms of cumulative regret, sample efficiency and computational complexity.
我们提出了一种图核多臂强盗算法Grab-UCB,用于在线学习大规模网络中的最优源放置,从而使从先验未知网络过程中获得的奖励最大化。这种不确定性要求在线学习,然而,在线学习受到维度的诅咒。为了实现样本效率,我们使用自适应图字典模型来描述网络过程,这通常会导致稀疏的谱表示。这使得一个数据高效的学习框架成为可能,其学习率随谱表示模型的维度而不是网络的维度而变化。然后,我们提出了一种在线顺序决策策略Grab-UCB,该策略在优化动作策略的同时学习频谱表示的参数。我们推导了依赖于网络参数的性能保证,这些参数进一步影响了序列决策策略的学习曲线。我们引入了一种计算简化的求解方法,Grab-arm-Light,一种沿着表示目标函数的多面体边缘行走的算法。仿真结果表明,所提出的在线学习算法优于通常将学习阶段与测试阶段分开的基线离线方法。结果证实了理论发现,并进一步强调了所提出的在线学习策略在累积遗憾、样本效率和计算复杂度方面的收益。
{"title":"Online Network Source Optimization with Graph-Kernel MAB","authors":"L. Toni, P. Frossard","doi":"10.48550/arXiv.2307.03641","DOIUrl":"https://doi.org/10.48550/arXiv.2307.03641","url":null,"abstract":"We propose Grab-UCB, a graph-kernel multi-arms bandit algorithm to learn online the optimal source placement in large scale networks, such that the reward obtained from a priori unknown network processes is maximized. The uncertainty calls for online learning, which suffers however from the curse of dimensionality. To achieve sample efficiency, we describe the network processes with an adaptive graph dictionary model, which typically leads to sparse spectral representations. This enables a data-efficient learning framework, whose learning rate scales with the dimension of the spectral representation model instead of the one of the network. We then propose Grab-UCB, an online sequential decision strategy that learns the parameters of the spectral representation while optimizing the action strategy. We derive the performance guarantees that depend on network parameters, which further influence the learning curve of the sequential decision strategy We introduce a computationally simplified solving method, Grab-arm-Light, an algorithm that walks along the edges of the polytope representing the objective function. Simulations results show that the proposed online learning algorithm outperforms baseline offline methods that typically separate the learning phase from the testing one. The results confirm the theoretical findings, and further highlight the gain of the proposed online learning strategy in terms of cumulative regret, sample efficiency and computational complexity.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81569572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
REAL: A Representative Error-Driven Approach for Active Learning REAL:主动学习的典型错误驱动方法
Cheng Chen, Yong Wang, Lizi Liao, Yueguo Chen, Xiaoyong Du
Given a limited labeling budget, active learning (AL) aims to sample the most informative instances from an unlabeled pool to acquire labels for subsequent model training. To achieve this, AL typically measures the informativeness of unlabeled instances based on uncertainty and diversity. However, it does not consider erroneous instances with their neighborhood error density, which have great potential to improve the model performance. To address this limitation, we propose $REAL$, a novel approach to select data instances with $underline{R}$epresentative $underline{E}$rrors for $underline{A}$ctive $underline{L}$earning. It identifies minority predictions as emph{pseudo errors} within a cluster and allocates an adaptive sampling budget for the cluster based on estimated error density. Extensive experiments on five text classification datasets demonstrate that $REAL$ consistently outperforms all best-performing baselines regarding accuracy and F1-macro scores across a wide range of hyperparameter settings. Our analysis also shows that $REAL$ selects the most representative pseudo errors that match the distribution of ground-truth errors along the decision boundary. Our code is publicly available at https://github.com/withchencheng/ECML_PKDD_23_Real.
在有限的标注预算下,主动学习(AL)旨在从未标注的池中抽取信息量最大的实例,以获取后续模型训练的标签。为了实现这一点,人工智能通常基于不确定性和多样性来度量未标记实例的信息量。然而,它没有考虑错误实例及其邻域误差密度,这对提高模型性能有很大的潜力。为了解决这一限制,我们提出了$REAL$,这是一种新颖的方法,用于选择具有$underline{R}$代表性$underline{E}$错误的数据实例用于$underline{A}$主动$underline{L}$学习。它将少数派预测识别为集群中的emph{伪错误},并根据估计的错误密度为集群分配自适应采样预算。在五个文本分类数据集上进行的大量实验表明,$REAL$在广泛的超参数设置中,在准确性和F1-macro分数方面始终优于所有表现最好的基线。我们的分析还表明,$REAL$选择了最具代表性的伪误差,这些伪误差与沿决策边界的真值误差分布相匹配。我们的代码可以在https://github.com/withchencheng/ECML_PKDD_23_Real上公开获得。
{"title":"REAL: A Representative Error-Driven Approach for Active Learning","authors":"Cheng Chen, Yong Wang, Lizi Liao, Yueguo Chen, Xiaoyong Du","doi":"10.48550/arXiv.2307.00968","DOIUrl":"https://doi.org/10.48550/arXiv.2307.00968","url":null,"abstract":"Given a limited labeling budget, active learning (AL) aims to sample the most informative instances from an unlabeled pool to acquire labels for subsequent model training. To achieve this, AL typically measures the informativeness of unlabeled instances based on uncertainty and diversity. However, it does not consider erroneous instances with their neighborhood error density, which have great potential to improve the model performance. To address this limitation, we propose $REAL$, a novel approach to select data instances with $underline{R}$epresentative $underline{E}$rrors for $underline{A}$ctive $underline{L}$earning. It identifies minority predictions as emph{pseudo errors} within a cluster and allocates an adaptive sampling budget for the cluster based on estimated error density. Extensive experiments on five text classification datasets demonstrate that $REAL$ consistently outperforms all best-performing baselines regarding accuracy and F1-macro scores across a wide range of hyperparameter settings. Our analysis also shows that $REAL$ selects the most representative pseudo errors that match the distribution of ground-truth errors along the decision boundary. Our code is publicly available at https://github.com/withchencheng/ECML_PKDD_23_Real.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76259226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning 时间差分动力学的特征子空间及其如何改进强化学习中的值逼近
Qiang He, Tianyi Zhou, Meng Fang, S. Maghsudi
We propose a novel value approximation method, namely Eigensubspace Regularized Critic (ERC) for deep reinforcement learning (RL). ERC is motivated by an analysis of the dynamics of Q-value approximation error in the Temporal-Difference (TD) method, which follows a path defined by the 1-eigensubspace of the transition kernel associated with the Markov Decision Process (MDP). It reveals a fundamental property of TD learning that has remained unused in previous deep RL approaches. In ERC, we propose a regularizer that guides the approximation error tending towards the 1-eigensubspace, resulting in a more efficient and stable path of value approximation. Moreover, we theoretically prove the convergence of the ERC method. Besides, theoretical analysis and experiments demonstrate that ERC effectively reduces the variance of value functions. Among 26 tasks in the DMControl benchmark, ERC outperforms state-of-the-art methods for 20. Besides, it shows significant advantages in Q-value approximation and variance reduction. Our code is available at https://sites.google.com/view/erc-ecml23/.
提出了一种新的用于深度强化学习(RL)的值逼近方法——特征子空间正则化批评家(ERC)。ERC的动机是对时间差分(TD)方法中q值近似误差的动态分析,该方法遵循由与马尔可夫决策过程(MDP)相关的转移核的1特征子空间定义的路径。它揭示了在以前的深度强化学习方法中未使用的TD学习的基本属性。在ERC中,我们提出了一个正则化器,该正则化器引导逼近误差倾向于1特征子空间,从而产生更有效和稳定的值逼近路径。并从理论上证明了ERC方法的收敛性。理论分析和实验表明,ERC有效地减小了值函数的方差。在DMControl基准测试的26个任务中,ERC在20个任务中优于最先进的方法。此外,它在q值逼近和方差减小方面具有显著的优势。我们的代码可在https://sites.google.com/view/erc-ecml23/上获得。
{"title":"Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning","authors":"Qiang He, Tianyi Zhou, Meng Fang, S. Maghsudi","doi":"10.48550/arXiv.2306.16750","DOIUrl":"https://doi.org/10.48550/arXiv.2306.16750","url":null,"abstract":"We propose a novel value approximation method, namely Eigensubspace Regularized Critic (ERC) for deep reinforcement learning (RL). ERC is motivated by an analysis of the dynamics of Q-value approximation error in the Temporal-Difference (TD) method, which follows a path defined by the 1-eigensubspace of the transition kernel associated with the Markov Decision Process (MDP). It reveals a fundamental property of TD learning that has remained unused in previous deep RL approaches. In ERC, we propose a regularizer that guides the approximation error tending towards the 1-eigensubspace, resulting in a more efficient and stable path of value approximation. Moreover, we theoretically prove the convergence of the ERC method. Besides, theoretical analysis and experiments demonstrate that ERC effectively reduces the variance of value functions. Among 26 tasks in the DMControl benchmark, ERC outperforms state-of-the-art methods for 20. Besides, it shows significant advantages in Q-value approximation and variance reduction. Our code is available at https://sites.google.com/view/erc-ecml23/.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82927050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DCID: Deep Canonical Information Decomposition DCID:深度规范信息分解
Alexander Rakowski, C. Lippert
We consider the problem of identifying the signal shared between two one-dimensional target variables, in the presence of additional multivariate observations. Canonical Correlation Analysis (CCA)-based methods have traditionally been used to identify shared variables, however, they were designed for multivariate targets and only offer trivial solutions for univariate cases. In the context of Multi-Task Learning (MTL), various models were postulated to learn features that are sparse and shared across multiple tasks. However, these methods were typically evaluated by their predictive performance. To the best of our knowledge, no prior studies systematically evaluated models in terms of correctly recovering the shared signal. Here, we formalize the setting of univariate shared information retrieval, and propose ICM, an evaluation metric which can be used in the presence of ground-truth labels, quantifying 3 aspects of the learned shared features. We further propose Deep Canonical Information Decomposition (DCID) - a simple, yet effective approach for learning the shared variables. We benchmark the models on a range of scenarios on synthetic data with known ground-truths and observe DCID outperforming the baselines in a wide range of settings. Finally, we demonstrate a real-life application of DCID on brain Magnetic Resonance Imaging (MRI) data, where we are able to extract more accurate predictors of changes in brain regions and obesity. The code for our experiments as well as the supplementary materials are available at https://github.com/alexrakowski/dcid
我们考虑在存在额外的多变量观测值的情况下,识别两个一维目标变量之间共享的信号的问题。基于典型相关分析(CCA)的方法传统上用于识别共享变量,然而,它们是为多变量目标设计的,只能为单变量情况提供平凡的解决方案。在多任务学习(MTL)的背景下,假设了各种模型来学习稀疏且跨多个任务共享的特征。然而,这些方法通常是通过其预测性能来评估的。据我们所知,之前没有研究系统地评估了正确恢复共享信号的模型。在这里,我们形式化了单变量共享信息检索的设置,并提出了ICM,一种可以在存在真值标签的情况下使用的评估度量,量化了学习到的共享特征的三个方面。我们进一步提出了深度规范信息分解(DCID)——一种简单而有效的学习共享变量的方法。我们在已知的基本事实的合成数据的一系列场景中对模型进行基准测试,并观察到DCID在广泛的设置中优于基线。最后,我们展示了DCID在脑磁共振成像(MRI)数据上的实际应用,我们能够提取出更准确的脑区域变化和肥胖预测因子。我们的实验代码以及补充材料可以在https://github.com/alexrakowski/dcid上找到
{"title":"DCID: Deep Canonical Information Decomposition","authors":"Alexander Rakowski, C. Lippert","doi":"10.48550/arXiv.2306.15619","DOIUrl":"https://doi.org/10.48550/arXiv.2306.15619","url":null,"abstract":"We consider the problem of identifying the signal shared between two one-dimensional target variables, in the presence of additional multivariate observations. Canonical Correlation Analysis (CCA)-based methods have traditionally been used to identify shared variables, however, they were designed for multivariate targets and only offer trivial solutions for univariate cases. In the context of Multi-Task Learning (MTL), various models were postulated to learn features that are sparse and shared across multiple tasks. However, these methods were typically evaluated by their predictive performance. To the best of our knowledge, no prior studies systematically evaluated models in terms of correctly recovering the shared signal. Here, we formalize the setting of univariate shared information retrieval, and propose ICM, an evaluation metric which can be used in the presence of ground-truth labels, quantifying 3 aspects of the learned shared features. We further propose Deep Canonical Information Decomposition (DCID) - a simple, yet effective approach for learning the shared variables. We benchmark the models on a range of scenarios on synthetic data with known ground-truths and observe DCID outperforming the baselines in a wide range of settings. Finally, we demonstrate a real-life application of DCID on brain Magnetic Resonance Imaging (MRI) data, where we are able to extract more accurate predictors of changes in brain regions and obesity. The code for our experiments as well as the supplementary materials are available at https://github.com/alexrakowski/dcid","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80895840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Few-shot Inductive Link Prediction on Knowledge Graphs: A Relational Anonymous Walk-guided Neural Process Approach 面向知识图的少镜头归纳链接预测:一种关系匿名行走引导神经过程方法
Zicheng Zhao, Linhao Luo, Shirui Pan, Quoc Viet Hung Nguyen, Chenggui Gong
Few-shot inductive link prediction on knowledge graphs (KGs) aims to predict missing links for unseen entities with few-shot links observed. Previous methods are limited to transductive scenarios, where entities exist in the knowledge graphs, so they are unable to handle unseen entities. Therefore, recent inductive methods utilize the sub-graphs around unseen entities to obtain the semantics and predict links inductively. However, in the few-shot setting, the sub-graphs are often sparse and cannot provide meaningful inductive patterns. In this paper, we propose a novel relational anonymous walk-guided neural process for few-shot inductive link prediction on knowledge graphs, denoted as RawNP. Specifically, we develop a neural process-based method to model a flexible distribution over link prediction functions. This enables the model to quickly adapt to new entities and estimate the uncertainty when making predictions. To capture general inductive patterns, we present a relational anonymous walk to extract a series of relational motifs from few-shot observations. These motifs reveal the distinctive semantic patterns on KGs that support inductive predictions. Extensive experiments on typical benchmark datasets demonstrate that our model derives new state-of-the-art performance.
基于知识图(KGs)的小片段链接预测旨在通过观察到的小片段链接来预测未知实体的缺失链接。以前的方法仅限于知识图中存在实体的转换场景,因此它们无法处理不可见的实体。因此,最近的归纳方法利用不可见实体周围的子图来获得语义并归纳地预测链接。然而,在少数镜头设置下,子图往往是稀疏的,不能提供有意义的归纳模式。在本文中,我们提出了一种新的关系匿名行走引导神经网络过程,用于知识图上的少量归纳链接预测,称为RawNP。具体来说,我们开发了一种基于神经过程的方法来建模链路预测函数上的灵活分布。这使模型能够快速适应新的实体,并在进行预测时估计不确定性。为了捕获一般的归纳模式,我们提出了一种关系匿名行走,从少量观察中提取一系列关系基序。这些基序揭示了KGs上独特的语义模式,支持归纳预测。在典型基准数据集上进行的大量实验表明,我们的模型获得了新的最先进的性能。
{"title":"Towards Few-shot Inductive Link Prediction on Knowledge Graphs: A Relational Anonymous Walk-guided Neural Process Approach","authors":"Zicheng Zhao, Linhao Luo, Shirui Pan, Quoc Viet Hung Nguyen, Chenggui Gong","doi":"10.48550/arXiv.2307.01204","DOIUrl":"https://doi.org/10.48550/arXiv.2307.01204","url":null,"abstract":"Few-shot inductive link prediction on knowledge graphs (KGs) aims to predict missing links for unseen entities with few-shot links observed. Previous methods are limited to transductive scenarios, where entities exist in the knowledge graphs, so they are unable to handle unseen entities. Therefore, recent inductive methods utilize the sub-graphs around unseen entities to obtain the semantics and predict links inductively. However, in the few-shot setting, the sub-graphs are often sparse and cannot provide meaningful inductive patterns. In this paper, we propose a novel relational anonymous walk-guided neural process for few-shot inductive link prediction on knowledge graphs, denoted as RawNP. Specifically, we develop a neural process-based method to model a flexible distribution over link prediction functions. This enables the model to quickly adapt to new entities and estimate the uncertainty when making predictions. To capture general inductive patterns, we present a relational anonymous walk to extract a series of relational motifs from few-shot observations. These motifs reveal the distinctive semantic patterns on KGs that support inductive predictions. Extensive experiments on typical benchmark datasets demonstrate that our model derives new state-of-the-art performance.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81449782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance 基于数据自适应能量距离的高维数据鲁棒分类
Jyotishka Ray Choudhury, Aytijhya Saha, Sarbojit Roy, S. Dutta
Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and analysis of some classifiers that are specifically designed for HDLSS data. These classifiers are free of tuning parameters and are robust, in the sense that they are devoid of any moment conditions of the underlying data distributions. It is shown that they yield perfect classification in the HDLSS asymptotic regime, under some fairly general conditions. The comparative performance of the proposed classifiers is also investigated. Our theoretical results are supported by extensive simulation studies and real data analysis, which demonstrate promising advantages of the proposed classification techniques over several widely recognized methods.
高维低样本量(HDLSS)数据的分类在各种现实世界的情况下提出了挑战,例如基因表达研究、癌症研究和医学成像。本文介绍了一些专门为HDLSS数据设计的分类器的开发和分析。这些分类器不需要调优参数,并且具有鲁棒性,因为它们没有底层数据分布的任何时刻条件。结果表明,在一些相当一般的条件下,它们在HDLSS渐近状态下产生完美的分类。对所提出的分类器的性能进行了比较研究。我们的理论结果得到了广泛的模拟研究和实际数据分析的支持,这些研究表明,与几种广泛认可的方法相比,所提出的分类技术具有很好的优势。
{"title":"Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance","authors":"Jyotishka Ray Choudhury, Aytijhya Saha, Sarbojit Roy, S. Dutta","doi":"10.48550/arXiv.2306.13985","DOIUrl":"https://doi.org/10.48550/arXiv.2306.13985","url":null,"abstract":"Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and analysis of some classifiers that are specifically designed for HDLSS data. These classifiers are free of tuning parameters and are robust, in the sense that they are devoid of any moment conditions of the underlying data distributions. It is shown that they yield perfect classification in the HDLSS asymptotic regime, under some fairly general conditions. The comparative performance of the proposed classifiers is also investigated. Our theoretical results are supported by extensive simulation studies and real data analysis, which demonstrate promising advantages of the proposed classification techniques over several widely recognized methods.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80168958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1