A Partition-insensitive Parallel Framework for Distributed Model Fitting

Xiaofei Wu, Rongmei Liang, Fabio Roli, Marcello Pelillo, Jing Yuan
{"title":"A Partition-insensitive Parallel Framework for Distributed Model Fitting","authors":"Xiaofei Wu, Rongmei Liang, Fabio Roli, Marcello Pelillo, Jing Yuan","doi":"arxiv-2406.00703","DOIUrl":null,"url":null,"abstract":"Distributed model fitting refers to the process of fitting a mathematical or\nstatistical model to the data using distributed computing resources, such that\ncomputing tasks are divided among multiple interconnected computers or nodes,\noften organized in a cluster or network. Most of the existing methods for\ndistributed model fitting are to formulate it in a consensus optimization\nproblem, and then build up algorithms based on the alternating direction method\nof multipliers (ADMM). This paper introduces a novel parallel framework for\nachieving a distributed model fitting. In contrast to previous consensus\nframeworks, the introduced parallel framework offers two notable advantages.\nFirstly, it exhibits insensitivity to sample partitioning, meaning that the\nsolution of the algorithm remains unaffected by variations in the number of\nslave nodes or/and the amount of data each node carries. Secondly, fewer\nvariables are required to be updated at each iteration, so that the proposed\nparallel framework performs in a more succinct and efficient way, and adapts to\nhigh-dimensional data. In addition, we prove that the algorithms under the new\nparallel framework have a worst-case linear convergence rate in theory.\nNumerical experiments confirm the generality, robustness, and accuracy of our\nproposed parallel framework.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.00703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Distributed model fitting refers to the process of fitting a mathematical or statistical model to the data using distributed computing resources, such that computing tasks are divided among multiple interconnected computers or nodes, often organized in a cluster or network. Most of the existing methods for distributed model fitting are to formulate it in a consensus optimization problem, and then build up algorithms based on the alternating direction method of multipliers (ADMM). This paper introduces a novel parallel framework for achieving a distributed model fitting. In contrast to previous consensus frameworks, the introduced parallel framework offers two notable advantages. Firstly, it exhibits insensitivity to sample partitioning, meaning that the solution of the algorithm remains unaffected by variations in the number of slave nodes or/and the amount of data each node carries. Secondly, fewer variables are required to be updated at each iteration, so that the proposed parallel framework performs in a more succinct and efficient way, and adapts to high-dimensional data. In addition, we prove that the algorithms under the new parallel framework have a worst-case linear convergence rate in theory. Numerical experiments confirm the generality, robustness, and accuracy of our proposed parallel framework.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分布式模型拟合的分区敏感并行框架
分布式模型拟合是指利用分布式计算资源对数学或统计模型进行数据拟合的过程,即计算任务在多台相互连接的计算机或节点之间进行分配,通常以集群或网络的形式组织。现有的分布式模型拟合方法大多是将其表述为一个共识优化问题,然后建立基于交替方向乘法(ADMM)的算法。本文介绍了一种实现分布式模型拟合的新型并行框架。与以往的共识框架相比,本文介绍的并行框架有两个显著优势:首先,它对样本分割不敏感,这意味着算法的求解不受从节点数或/和每个节点携带的数据量变化的影响。其次,每次迭代需要更新的变量很少,因此所提出的并行框架能以更简洁、更高效的方式运行,并适应高维数据。此外,我们还证明了新并行框架下的算法在理论上具有最坏情况下的线性收敛率。数值实验证实了我们提出的并行框架的通用性、鲁棒性和准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model-Embedded Gaussian Process Regression for Parameter Estimation in Dynamical System Effects of the entropy source on Monte Carlo simulations A Robust Approach to Gaussian Processes Implementation HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models Reducing Shape-Graph Complexity with Application to Classification of Retinal Blood Vessels and Neurons
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1