洗牌模型上的隐私保护弗兰克-沃尔夫

IF 0.9 4区数学 Q3 MATHEMATICS, APPLIED Acta Mathematicae Applicatae Sinica, English Series Pub Date : 2024-06-01 DOI:10.1007/s10255-024-1095-6

Ling-jie Zhang, Shi-song Wu, Hai Zhang

{"title":"洗牌模型上的隐私保护弗兰克-沃尔夫","authors":"Ling-jie Zhang, Shi-song Wu, Hai Zhang","doi":"10.1007/s10255-024-1095-6","DOIUrl":null,"url":null,"abstract":"<div>In this paper, we design the differentially private variants of the classical Frank-Wolfe algorithm with shuffle model in the optimization of machine learning. Under weak assumptions and the generalized linear loss (GLL) structure, we propose a noisy Frank-Wolfe with shuffle model algorithm (NoisyFWS) and a noisy variance-reduced Frank-Wolfe with the shuffle model algorithm (NoisyVRFWS) by adding calibrated laplace noise under shuffling scheme in the ℓp(p ∈ [1, 2])-case, and study their privacy as well as utility guarantees for the Hölder smoothness GLL. In particular, the privacy guarantees are mainly achieved by using advanced composition and privacy amplification by shuffling. The utility bounds of the NoisyFWS and NoisyVRFWS are analyzed and obtained the optimal excess population risks \\({\\cal O}({n^{ - {{1 + \\alpha } \\over {4\\alpha }}}} + {{\\log (d)\\sqrt {\\log ({1 \\mathord{\\left/ {\\vphantom {1 \\delta }} \\right.} \\delta })} } \\over {n\\epsilon\\,}})\\) and \\({\\cal O}({n^{ - {{1 + \\alpha } \\over {4\\alpha }}}} + {{\\log (d)\\sqrt {\\log ({1 \\mathord{\\left/ {\\vphantom {1 \\delta }} \\right.} \\delta })} } \\over {{n^2\\epsilon}\\,}})\\) with gradient complexity \\({\\cal O}({n^{ - {{{{(1 + \\alpha )}^2}} \\over {4{\\alpha ^2}}}}})\\) for \\(\\alpha \\in \\left[ {{1 \\mathord{\\left/ {\\vphantom {1 {\\sqrt 3 ,\\,1}}} \\right.} {\\sqrt 3 ,\\,1}}} \\right]\\). It turns out that the risk rates under shuffling scheme are a nearly-dimension independent rate, which is consistent with the previous work in some cases. In addition, there is a vital tradeoff between (α, L)-Hölder smoothness GLL and the gradient complexity. The linear gradient complexity \\({\\cal O}(n)\\) is showed by the parameter α = 1.</div>","PeriodicalId":6951,"journal":{"name":"Acta Mathematicae Applicatae Sinica, English Series","volume":"40 4","pages":"887 - 907"},"PeriodicalIF":0.9000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Privacy-Preserving Frank-Wolfe on Shuffle Model\",\"authors\":\"Ling-jie Zhang, Shi-song Wu, Hai Zhang\",\"doi\":\"10.1007/s10255-024-1095-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>In this paper, we design the differentially private variants of the classical Frank-Wolfe algorithm with shuffle model in the optimization of machine learning. Under weak assumptions and the generalized linear loss (GLL) structure, we propose a noisy Frank-Wolfe with shuffle model algorithm (NoisyFWS) and a noisy variance-reduced Frank-Wolfe with the shuffle model algorithm (NoisyVRFWS) by adding calibrated laplace noise under shuffling scheme in the ℓp(p ∈ [1, 2])-case, and study their privacy as well as utility guarantees for the Hölder smoothness GLL. In particular, the privacy guarantees are mainly achieved by using advanced composition and privacy amplification by shuffling. The utility bounds of the NoisyFWS and NoisyVRFWS are analyzed and obtained the optimal excess population risks \\\\({\\\\cal O}({n^{ - {{1 + \\\\alpha } \\\\over {4\\\\alpha }}}} + {{\\\\log (d)\\\\sqrt {\\\\log ({1 \\\\mathord{\\\\left/ {\\\\vphantom {1 \\\\delta }} \\\\right.} \\\\delta })} } \\\\over {n\\\\epsilon\\\\,}})\\\\) and \\\\({\\\\cal O}({n^{ - {{1 + \\\\alpha } \\\\over {4\\\\alpha }}}} + {{\\\\log (d)\\\\sqrt {\\\\log ({1 \\\\mathord{\\\\left/ {\\\\vphantom {1 \\\\delta }} \\\\right.} \\\\delta })} } \\\\over {{n^2\\\\epsilon}\\\\,}})\\\\) with gradient complexity \\\\({\\\\cal O}({n^{ - {{{{(1 + \\\\alpha )}^2}} \\\\over {4{\\\\alpha ^2}}}}})\\\\) for \\\\(\\\\alpha \\\\in \\\\left[ {{1 \\\\mathord{\\\\left/ {\\\\vphantom {1 {\\\\sqrt 3 ,\\\\,1}}} \\\\right.} {\\\\sqrt 3 ,\\\\,1}}} \\\\right]\\\\). It turns out that the risk rates under shuffling scheme are a nearly-dimension independent rate, which is consistent with the previous work in some cases. In addition, there is a vital tradeoff between (α, L)-Hölder smoothness GLL and the gradient complexity. The linear gradient complexity \\\\({\\\\cal O}(n)\\\\) is showed by the parameter α = 1.</div>\",\"PeriodicalId\":6951,\"journal\":{\"name\":\"Acta Mathematicae Applicatae Sinica, English Series\",\"volume\":\"40 4\",\"pages\":\"887 - 907\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acta Mathematicae Applicatae Sinica, English Series\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10255-024-1095-6\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Mathematicae Applicatae Sinica, English Series","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10255-024-1095-6","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

摘要

本文设计了机器学习优化中带有洗牌模型的经典弗兰克-沃尔夫算法的不同私有变体。在弱假设和广义线性损失（GLL）结构下，我们提出了带洗牌模型的噪声弗兰克-沃尔夫算法（NoisyFWS）和在ℓp(p∈ [1, 2])情况下，通过在洗牌方案下添加校准的拉普拉斯噪声而实现的带洗牌模型的噪声方差降低弗兰克-沃尔夫算法（NoisyVRFWS），并研究了它们的隐私性以及霍尔德平滑性 GLL 的效用保证。其中，隐私保证主要是通过使用高级合成和洗牌的隐私放大来实现的。分析了NoisyFWS和NoisyVRFWS的效用边界，并得到了最优过剩人口风险（{\cal O}({n^{ - {{1 + \alpha }\over {4\alpha }}}}+ {{log (d)\sqrt {log ({1 \mathord\{left/ {\vphantom {1 \delta }} \right.} \delta })} }}\over {n\epsilon\,}}) and\({\cal O}({n^{ - {{1 +\alpha })\over {4\alpha }}}}+ {{log (d)sqrt {log ({1 \mathord\{left/ {\vphantom {1 \delta }} \right.} \delta })} }}\over {{n^2\epsilon}\})\) with gradient complexity \({\cal O}({n^{ - {{{{(1 +\alpha )}^2}}\over {4{alpha ^2}}}}})}) for \(\alpha \in \left[ {{1 \mathord{ \left/ { \vphantom {1 { \sqrt 3 ,\,1}}})\right.}{sqrt 3 ,\,1}}\right]\).结果表明，洗牌方案下的风险率是一个几乎与维度无关的比率，这在某些情况下与之前的工作是一致的。此外，(α, L)-荷尔德平滑度 GLL 与梯度复杂度之间存在着重要的权衡。线性梯度复杂度 \({c\al O}(n)\) 由参数 α = 1 表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Privacy-Preserving Frank-Wolfe on Shuffle Model

In this paper, we design the differentially private variants of the classical Frank-Wolfe algorithm with shuffle model in the optimization of machine learning. Under weak assumptions and the generalized linear loss (GLL) structure, we propose a noisy Frank-Wolfe with shuffle model algorithm (NoisyFWS) and a noisy variance-reduced Frank-Wolfe with the shuffle model algorithm (NoisyVRFWS) by adding calibrated laplace noise under shuffling scheme in the ℓ_p(p ∈ [1, 2])-case, and study their privacy as well as utility guarantees for the Hölder smoothness GLL. In particular, the privacy guarantees are mainly achieved by using advanced composition and privacy amplification by shuffling. The utility bounds of the NoisyFWS and NoisyVRFWS are analyzed and obtained the optimal excess population risks \({\cal O}({n^{ - {{1 + \alpha } \over {4\alpha }}}} + {{\log (d)\sqrt {\log ({1 \mathord{\left/ {\vphantom {1 \delta }} \right.} \delta })} } \over {n\epsilon\,}})\) and \({\cal O}({n^{ - {{1 + \alpha } \over {4\alpha }}}} + {{\log (d)\sqrt {\log ({1 \mathord{\left/ {\vphantom {1 \delta }} \right.} \delta })} } \over {{n^2\epsilon}\,}})\) with gradient complexity \({\cal O}({n^{ - {{{{(1 + \alpha )}^2}} \over {4{\alpha ^2}}}}})\) for \(\alpha \in \left[ {{1 \mathord{\left/ {\vphantom {1 {\sqrt 3 ,\,1}}} \right.} {\sqrt 3 ,\,1}}} \right]\). It turns out that the risk rates under shuffling scheme are a nearly-dimension independent rate, which is consistent with the previous work in some cases. In addition, there is a vital tradeoff between (α, L)-Hölder smoothness GLL and the gradient complexity. The linear gradient complexity \({\cal O}(n)\) is showed by the parameter α = 1.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Acta Mathematicae Applicatae Sinica, English Series 数学-应用数学

CiteScore

1.30

自引率

0.00%

发文量

审稿时长

3.0 months

期刊介绍： Acta Mathematicae Applicatae Sinica (English Series) is a quarterly journal established by the Chinese Mathematical Society. The journal publishes high quality research papers from all branches of applied mathematics, and particularly welcomes those from partial differential equations, computational mathematics, applied probability, mathematical finance, statistics, dynamical systems, optimization and management science.