Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)最新文献

英文中文

Learning Data Representations with Joint Diffusion Models 用联合扩散模型学习数据表示

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-31 DOI: 10.48550/arXiv.2301.13622

K. Deja, T. Trzciński, Jakub M. Tomczak

Joint machine learning models that allow synthesizing and classifying data often offer uneven performance between those tasks or are unstable to train. In this work, we depart from a set of empirical observations that indicate the usefulness of internal representations built by contemporary deep diffusion-based generative models not only for generating but also predicting. We then propose to extend the vanilla diffusion model with a classifier that allows for stable joint end-to-end training with shared parameterization between those objectives. The resulting joint diffusion model outperforms recent state-of-the-art hybrid methods in terms of both classification and generation quality on all evaluated benchmarks. On top of our joint training approach, we present how we can directly benefit from shared generative and discriminative representations by introducing a method for visual counterfactual explanations.

允许综合和分类数据的联合机器学习模型通常在这些任务之间表现不平衡，或者训练不稳定。在这项工作中，我们偏离了一组经验观察，这些观察表明，当代基于深度扩散的生成模型构建的内部表征不仅用于生成，而且用于预测。然后，我们提出用一个分类器扩展香草扩散模型，该分类器允许在这些目标之间共享参数化进行稳定的联合端到端训练。由此产生的联合扩散模型在所有评估基准的分类和生成质量方面都优于最近最先进的混合方法。在我们的联合训练方法之上，我们介绍了如何通过引入视觉反事实解释的方法，直接受益于共享的生成和判别表征。

引用次数: 3

Voting from Nearest Tasks: Meta-Vote Pruning of Pre-trained Models for Downstream Tasks 从最近的任务投票:下游任务预训练模型的元投票修剪

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-27 DOI: 10.48550/arXiv.2301.11560

Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

As a few large-scale pre-trained models become the major choices of various applications, new challenges arise for model pruning, e.g., can we avoid pruning the same model from scratch for every downstream task? How to reuse the pruning results of previous tasks to accelerate the pruning for a new task? To address these challenges, we create a small model for a new task from the pruned models of similar tasks. We show that a few fine-tuning steps on this model suffice to produce a promising pruned-model for the new task. We study this ''meta-pruning'' from nearest tasks on two major classes of pre-trained models, convolutional neural network (CNN) and vision transformer (ViT), under a limited budget of pruning iterations. Our study begins by investigating the overlap of pruned models for similar tasks and how the overlap changes over different layers and blocks. Inspired by these discoveries, we develop a simple but effective ''Meta-Vote Pruning (MVP)'' method that significantly reduces the pruning iterations for a new task by initializing a sub-network from the pruned models of its nearest tasks. In experiments, we demonstrate MVP's advantages in accuracy, efficiency, and generalization through extensive empirical studies and comparisons with popular pruning methods over several datasets.

随着一些大规模预训练模型成为各种应用的主要选择，对模型修剪提出了新的挑战，例如，我们能否避免为每个下游任务从头开始修剪相同的模型?如何重用以前任务的剪枝结果来加速新任务的剪枝?为了应对这些挑战，我们从类似任务的精简模型中为新任务创建一个小模型。我们展示了这个模型上的几个微调步骤足以为新任务产生一个有希望的修剪模型。我们在有限的剪枝迭代预算下，从卷积神经网络(CNN)和视觉变压器(ViT)两大类预训练模型的最近任务上研究了这种“元剪枝”。我们的研究首先调查了类似任务的修剪模型的重叠，以及重叠在不同层和块上的变化。受这些发现的启发，我们开发了一种简单而有效的“元投票修剪(MVP)”方法，该方法通过从最近任务的修剪模型初始化子网络来显着减少新任务的修剪迭代。在实验中，我们通过广泛的实证研究和与几个数据集上流行的修剪方法的比较，证明了MVP在准确性、效率和泛化方面的优势。

{"title":"Voting from Nearest Tasks: Meta-Vote Pruning of Pre-trained Models for Downstream Tasks","authors":"Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang","doi":"10.48550/arXiv.2301.11560","DOIUrl":"https://doi.org/10.48550/arXiv.2301.11560","url":null,"abstract":"As a few large-scale pre-trained models become the major choices of various applications, new challenges arise for model pruning, e.g., can we avoid pruning the same model from scratch for every downstream task? How to reuse the pruning results of previous tasks to accelerate the pruning for a new task? To address these challenges, we create a small model for a new task from the pruned models of similar tasks. We show that a few fine-tuning steps on this model suffice to produce a promising pruned-model for the new task. We study this ''meta-pruning'' from nearest tasks on two major classes of pre-trained models, convolutional neural network (CNN) and vision transformer (ViT), under a limited budget of pruning iterations. Our study begins by investigating the overlap of pruned models for similar tasks and how the overlap changes over different layers and blocks. Inspired by these discoveries, we develop a simple but effective ''Meta-Vote Pruning (MVP)'' method that significantly reduces the pruning iterations for a new task by initializing a sub-network from the pruned models of its nearest tasks. In experiments, we demonstrate MVP's advantages in accuracy, efficiency, and generalization through extensive empirical studies and comparisons with popular pruning methods over several datasets.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":"106 1","pages":"52-68"},"PeriodicalIF":0.0,"publicationDate":"2023-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80991265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Understanding Difficulty-based Sample Weighting with a Universal Difficulty Measure 用通用难度度量理解基于难度的样本加权

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-12 DOI: 10.48550/arXiv.2301.04850

Xiaoling Zhou, Ou Wu, Weiyao Zhu, Ziyang Liang

Sample weighting is widely used in deep learning. A large number of weighting methods essentially utilize the learning difficulty of training samples to calculate their weights. In this study, this scheme is called difficulty-based weighting. Two important issues arise when explaining this scheme. First, a unified difficulty measure that can be theoretically guaranteed for training samples does not exist. The learning difficulties of the samples are determined by multiple factors including noise level, imbalance degree, margin, and uncertainty. Nevertheless, existing measures only consider a single factor or in part, but not in their entirety. Second, a comprehensive theoretical explanation is lacking with respect to demonstrating why difficulty-based weighting schemes are effective in deep learning. In this study, we theoretically prove that the generalization error of a sample can be used as a universal difficulty measure. Furthermore, we provide formal theoretical justifications on the role of difficulty-based weighting for deep learning, consequently revealing its positive influences on both the optimization dynamics and generalization performance of deep models, which is instructive to existing weighting schemes.

样本加权在深度学习中有着广泛的应用。大量的加权方法本质上是利用训练样本的学习难度来计算其权重。在本研究中，这种方案被称为基于难度的加权。在解释这个方案时，出现了两个重要问题。首先，不存在理论上可以保证训练样本的统一难度度量。样本的学习困难是由噪声水平、不平衡程度、裕度和不确定性等多种因素决定的。然而，现有的措施只考虑单一因素或部分因素，而不是全部因素。其次，缺乏一个全面的理论解释来证明为什么基于难度的加权方案在深度学习中是有效的。在本研究中，我们从理论上证明了样本的泛化误差可以作为通用的难度度量。此外，我们提供了基于困难度的权重在深度学习中的作用的形式化理论证明，从而揭示了其对深度模型的优化动力学和泛化性能的积极影响，这对现有的加权方案具有指导意义。

{"title":"Understanding Difficulty-based Sample Weighting with a Universal Difficulty Measure","authors":"Xiaoling Zhou, Ou Wu, Weiyao Zhu, Ziyang Liang","doi":"10.48550/arXiv.2301.04850","DOIUrl":"https://doi.org/10.48550/arXiv.2301.04850","url":null,"abstract":"Sample weighting is widely used in deep learning. A large number of weighting methods essentially utilize the learning difficulty of training samples to calculate their weights. In this study, this scheme is called difficulty-based weighting. Two important issues arise when explaining this scheme. First, a unified difficulty measure that can be theoretically guaranteed for training samples does not exist. The learning difficulties of the samples are determined by multiple factors including noise level, imbalance degree, margin, and uncertainty. Nevertheless, existing measures only consider a single factor or in part, but not in their entirety. Second, a comprehensive theoretical explanation is lacking with respect to demonstrating why difficulty-based weighting schemes are effective in deep learning. In this study, we theoretically prove that the generalization error of a sample can be used as a universal difficulty measure. Furthermore, we provide formal theoretical justifications on the role of difficulty-based weighting for deep learning, consequently revealing its positive influences on both the optimization dynamics and generalization performance of deep models, which is instructive to existing weighting schemes.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":"24 1","pages":"68-84"},"PeriodicalIF":0.0,"publicationDate":"2023-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86083458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part I 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第一部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26387-3

引用次数: 1

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part IV 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第四部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26412-2

引用次数: 0

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part III 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第三部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26409-2

引用次数: 1

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part VI 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第六部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26422-1

引用次数: 0

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part V 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第五部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26419-1

引用次数: 0

Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability. 差分私有贝叶斯神经网络的准确性、隐私性和可靠性。

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26412-2_37

Qiyiwen Zhang, Zhiqi Bu, Kan Chen, Qi Long

Bayesian neural network (BNN) allows for uncertainty quantification in prediction, offering an advantage over regular neural networks that has not been explored in the differential privacy (DP) framework. We fill this important gap by leveraging recent development in Bayesian deep learning and privacy accounting to offer a more precise analysis of the trade-off between privacy and accuracy in BNN. We propose three DP-BNNs that characterize the weight uncertainty for the same network architecture in distinct ways, namely DP-SGLD (via the noisy gradient method), DP-BBP (via changing the parameters of interest) and DP-MC Dropout (via the model architecture). Interestingly, we show a new equivalence between DP-SGD and DP-SGLD, implying that some non-Bayesian DP training naturally allows for uncertainty quantification. However, the hyperparameters such as learning rate and batch size, can have different or even opposite effects in DP-SGD and DP-SGLD. Extensive experiments are conducted to compare DP-BNNs, in terms of privacy guarantee, prediction accuracy, uncertainty quantification, calibration, computation speed, and generalizability to network architecture. As a result, we observe a new tradeoff between the privacy and the reliability. When compared to non-DP and non-Bayesian approaches, DP-SGLD is remarkably accurate under strong privacy guarantee, demonstrating the great potential of DP-BNN in real-world tasks.

贝叶斯神经网络(BNN)允许在预测中对不确定性进行量化，与差分隐私(DP)框架中尚未探索的常规神经网络相比，它具有优势。我们利用贝叶斯深度学习和隐私会计的最新发展来填补这一重要空白，对BNN中隐私和准确性之间的权衡提供更精确的分析。我们提出了三个dp - bnn，它们以不同的方式表征相同网络架构的权重不确定性，即DP-SGLD(通过噪声梯度方法)，DP-BBP(通过改变感兴趣的参数)和DP-MC Dropout(通过模型架构)。有趣的是，我们展示了DP- sgd和DP- sgld之间新的等价性，这意味着一些非贝叶斯DP训练自然允许不确定性量化。然而，学习率和批大小等超参数在DP-SGD和DP-SGD中可能会产生不同甚至相反的影响。在隐私保障、预测精度、不确定性量化、校准、计算速度和对网络架构的通用性等方面进行了大量的实验来比较dp - bnn。因此，我们观察到隐私和可靠性之间的新权衡。与非dp和非贝叶斯方法相比，DP-SGLD在强大的隐私保证下具有非常高的准确性，这表明DP-BNN在现实任务中的巨大潜力。

{"title":"Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability.","authors":"Qiyiwen Zhang, Zhiqi Bu, Kan Chen, Qi Long","doi":"10.1007/978-3-031-26412-2_37","DOIUrl":"https://doi.org/10.1007/978-3-031-26412-2_37","url":null,"abstract":"<p><p>Bayesian neural network (BNN) allows for uncertainty quantification in prediction, offering an advantage over regular neural networks that has not been explored in the differential privacy (DP) framework. We fill this important gap by leveraging recent development in Bayesian deep learning and privacy accounting to offer a more precise analysis of the trade-off between privacy and accuracy in BNN. We propose three DP-BNNs that characterize the weight uncertainty for the same network architecture in distinct ways, namely DP-SGLD (via the noisy gradient method), DP-BBP (via changing the parameters of interest) and DP-MC Dropout (via the model architecture). Interestingly, we show a new equivalence between DP-SGD and DP-SGLD, implying that some non-Bayesian DP training naturally allows for uncertainty quantification. However, the hyperparameters such as learning rate and batch size, can have different or even opposite effects in DP-SGD and DP-SGLD. Extensive experiments are conducted to compare DP-BNNs, in terms of privacy guarantee, prediction accuracy, uncertainty quantification, calibration, computation speed, and generalizability to network architecture. As a result, we observe a new tradeoff between the privacy and the reliability. When compared to non-DP and non-Bayesian approaches, DP-SGLD is remarkably accurate under strong privacy guarantee, demonstrating the great potential of DP-BNN in real-world tasks.</p>","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":"13716 ","pages":"604-619"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10438902/pdf/nihms-1861884.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10040859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part II 数据库中的机器学习和知识发现:欧洲会议，ECML PKDD 2022，格勒诺布尔，法国，9月19-23日，2022，会议录，第二部分

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-26390-3

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀