{"title":"为差异化私人成对学习设定更敏锐的超额风险边界","authors":"Yilin Kang , Jian Li , Yong Liu , Weiping Wang","doi":"10.1016/j.neucom.2024.128610","DOIUrl":null,"url":null,"abstract":"<div><div>Pairwise learning is a vital part of machine learning. It depends on pairs of training instances, and is naturally fit for modeling relationships between samples. However, as a data driven paradigm, it faces huge privacy issues. Differential privacy (DP) is a useful tool to protect the privacy of machine learning, but corresponding excess population risk bounds are loose in existing DP pairwise learning analysis. In this paper, we propose a gradient perturbation algorithm for pairwise learning to get better risk bounds under Polyak–Łojasiewicz condition, including both convex and non-convex cases. Specifically, for the theoretical risk bound in expectation, previous best results are of rates <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><mi>p</mi></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><mi>n</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> and <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><mi>n</mi><mi>ϵ</mi></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> under strongly convex condition and convex conditions, respectively. In this paper, we use the <em>on-average stability</em> and achieve an <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mo>min</mo><mrow><mo>{</mo><mrow><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>1</mn><mo>.</mo><mn>5</mn></mrow></msup><mi>ϵ</mi></mrow></mfrac><mo>+</mo><mfrac><mrow><msup><mrow><mi>p</mi></mrow><mrow><mn>1</mn><mo>.</mo><mn>5</mn></mrow></msup></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn><mo>.</mo><mn>5</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>3</mn></mrow></msup></mrow></mfrac><mo>,</mo><mfrac><mrow><mi>p</mi></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><mi>n</mi></mrow></mfrac></mrow><mo>}</mo></mrow></mrow><mo>)</mo></mrow></mrow></math></span> bound, significantly improving previous bounds. For the high probability risk bound, previous best results are analyzed by the uniform stability, and <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup><mo>+</mo><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt><mi>ϵ</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> excess population risk bounds are achieved under strongly convex or convex conditions, where <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup></math></span> is the traditional pairwise uniform stability parameter, it is large since it considers the worst case of the loss sensitivity. In this paper, we propose the <em>pairwise locally elastic stability</em> and improve the high probability bound to <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><msub><mrow><mi>β</mi></mrow><mrow><mi>E</mi></mrow></msub></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt></mrow></mfrac><mo>+</mo><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt><mi>ϵ</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span>, in which the pairwise locally elastic stability parameter <span><math><mrow><msub><mrow><mi>β</mi></mrow><mrow><mi>E</mi></mrow></msub><mo>≪</mo><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup></mrow></math></span> because it considers the average sensitivity of the pairwise loss function.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"610 ","pages":"Article 128610"},"PeriodicalIF":6.7000,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards sharper excess risk bounds for differentially private pairwise learning\",\"authors\":\"Yilin Kang , Jian Li , Yong Liu , Weiping Wang\",\"doi\":\"10.1016/j.neucom.2024.128610\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pairwise learning is a vital part of machine learning. It depends on pairs of training instances, and is naturally fit for modeling relationships between samples. However, as a data driven paradigm, it faces huge privacy issues. Differential privacy (DP) is a useful tool to protect the privacy of machine learning, but corresponding excess population risk bounds are loose in existing DP pairwise learning analysis. In this paper, we propose a gradient perturbation algorithm for pairwise learning to get better risk bounds under Polyak–Łojasiewicz condition, including both convex and non-convex cases. Specifically, for the theoretical risk bound in expectation, previous best results are of rates <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><mi>p</mi></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><mi>n</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> and <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><mi>n</mi><mi>ϵ</mi></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> under strongly convex condition and convex conditions, respectively. In this paper, we use the <em>on-average stability</em> and achieve an <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mo>min</mo><mrow><mo>{</mo><mrow><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>1</mn><mo>.</mo><mn>5</mn></mrow></msup><mi>ϵ</mi></mrow></mfrac><mo>+</mo><mfrac><mrow><msup><mrow><mi>p</mi></mrow><mrow><mn>1</mn><mo>.</mo><mn>5</mn></mrow></msup></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn><mo>.</mo><mn>5</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>3</mn></mrow></msup></mrow></mfrac><mo>,</mo><mfrac><mrow><mi>p</mi></mrow><mrow><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><msup><mrow><mi>ϵ</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow></mfrac><mo>+</mo><mfrac><mrow><mn>1</mn></mrow><mrow><mi>n</mi></mrow></mfrac></mrow><mo>}</mo></mrow></mrow><mo>)</mo></mrow></mrow></math></span> bound, significantly improving previous bounds. For the high probability risk bound, previous best results are analyzed by the uniform stability, and <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup><mo>+</mo><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt><mi>ϵ</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span> excess population risk bounds are achieved under strongly convex or convex conditions, where <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup></math></span> is the traditional pairwise uniform stability parameter, it is large since it considers the worst case of the loss sensitivity. In this paper, we propose the <em>pairwise locally elastic stability</em> and improve the high probability bound to <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mrow><mfrac><mrow><msub><mrow><mi>β</mi></mrow><mrow><mi>E</mi></mrow></msub></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt></mrow></mfrac><mo>+</mo><mfrac><mrow><msqrt><mrow><mi>p</mi></mrow></msqrt></mrow><mrow><msqrt><mrow><mi>n</mi></mrow></msqrt><mi>ϵ</mi></mrow></mfrac></mrow><mo>)</mo></mrow></mrow></math></span>, in which the pairwise locally elastic stability parameter <span><math><mrow><msub><mrow><mi>β</mi></mrow><mrow><mi>E</mi></mrow></msub><mo>≪</mo><msubsup><mrow><mi>β</mi></mrow><mrow><mi>n</mi></mrow><mrow><mi>U</mi></mrow></msubsup></mrow></math></span> because it considers the average sensitivity of the pairwise loss function.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"610 \",\"pages\":\"Article 128610\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092523122401381X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/9/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092523122401381X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Towards sharper excess risk bounds for differentially private pairwise learning
Pairwise learning is a vital part of machine learning. It depends on pairs of training instances, and is naturally fit for modeling relationships between samples. However, as a data driven paradigm, it faces huge privacy issues. Differential privacy (DP) is a useful tool to protect the privacy of machine learning, but corresponding excess population risk bounds are loose in existing DP pairwise learning analysis. In this paper, we propose a gradient perturbation algorithm for pairwise learning to get better risk bounds under Polyak–Łojasiewicz condition, including both convex and non-convex cases. Specifically, for the theoretical risk bound in expectation, previous best results are of rates and under strongly convex condition and convex conditions, respectively. In this paper, we use the on-average stability and achieve an bound, significantly improving previous bounds. For the high probability risk bound, previous best results are analyzed by the uniform stability, and excess population risk bounds are achieved under strongly convex or convex conditions, where is the traditional pairwise uniform stability parameter, it is large since it considers the worst case of the loss sensitivity. In this paper, we propose the pairwise locally elastic stability and improve the high probability bound to , in which the pairwise locally elastic stability parameter because it considers the average sensitivity of the pairwise loss function.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.