Pareto Optimization for Fair Subset Selection: A Case Study on Personalized Recommendation

IF 11.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Evolutionary Computation Pub Date : 2024-10-29 DOI:10.1109/TEVC.2024.3487624

Lei Zhang;Zhanpeng Wang;Haipeng Yang;Xiang Sun;Fan Cheng

{"title":"Pareto Optimization for Fair Subset Selection: A Case Study on Personalized Recommendation","authors":"Lei Zhang;Zhanpeng Wang;Haipeng Yang;Xiang Sun;Fan Cheng","doi":"10.1109/TEVC.2024.3487624","DOIUrl":null,"url":null,"abstract":"Subset selection is a fundamental problem across a wide range of applications. In this study, we explore scenarios where the variables within the original dataset are divided into distinct groups. Subsequently, we investigate an optimization problem that includes extra fairness constraints (i.e., partition matroid constraints), restricting the selection of a specified number of variables from each group, which is known as the fair subset selection (FSS) problem. First, for the case where the existing Pareto optimization algorithms do not have the ability to well handle the fairness constraints, due to they do not consider fairness constraints in the process of solutions generation. In this article, a Pareto Optimization algorithm named POFSS is proposed for FSS, by introducing a designed fairness balance flip operator. Also, we prove that POFSS has the approximation ability of <inline-formula> <tex-math>$\\max \\{ {}[{\\alpha }/{l}](1-e^{-\\gamma }), {}({\\gamma }/{k}), 1-e^{-r\\overline {k}/k } \\}$ </tex-math></inline-formula> in polynomial time when <inline-formula> <tex-math>$ l \\lt \\log _{2}{n}$ </tex-math></inline-formula> and the probability of mutation is constant <inline-formula> <tex-math>$c/n~(c\\lt 2)$ </tex-math></inline-formula> and this approximation ratio is no worse than the previous theoretical result (<inline-formula> <tex-math>$\\alpha $ </tex-math></inline-formula>, <inline-formula> <tex-math>$\\gamma $ </tex-math></inline-formula> represent two submodular ratios, and l is the number of disjoint groups). In addition, we apply POFSS to one typical FSS task named personalized recommendation (PR), where an acceleration strategy is designed and the acceleration ratio is strictly proved. Finally, the experimental results on the PR task show the proposed POFSS outperforms the state-of-the-art methods in addressing the FSS.","PeriodicalId":13206,"journal":{"name":"IEEE Transactions on Evolutionary Computation","volume":"29 5","pages":"2198-2212"},"PeriodicalIF":11.7000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10737706/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Subset selection is a fundamental problem across a wide range of applications. In this study, we explore scenarios where the variables within the original dataset are divided into distinct groups. Subsequently, we investigate an optimization problem that includes extra fairness constraints (i.e., partition matroid constraints), restricting the selection of a specified number of variables from each group, which is known as the fair subset selection (FSS) problem. First, for the case where the existing Pareto optimization algorithms do not have the ability to well handle the fairness constraints, due to they do not consider fairness constraints in the process of solutions generation. In this article, a Pareto Optimization algorithm named POFSS is proposed for FSS, by introducing a designed fairness balance flip operator. Also, we prove that POFSS has the approximation ability of

$\max \{ {}[{\alpha }/{l}](1-e^{-\gamma }), {}({\gamma }/{k}), 1-e^{-r\overline {k}/k } \}$

in polynomial time when

$ l \lt \log _{2}{n}$

and the probability of mutation is constant

$c/n~(c\lt 2)$

and this approximation ratio is no worse than the previous theoretical result (

$\alpha $

$\gamma $

represent two submodular ratios, and l is the number of disjoint groups). In addition, we apply POFSS to one typical FSS task named personalized recommendation (PR), where an acceleration strategy is designed and the acceleration ratio is strictly proved. Finally, the experimental results on the PR task show the proposed POFSS outperforms the state-of-the-art methods in addressing the FSS.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

公平子集选择的帕累托优化：个性化推荐案例研究

子集选择是广泛应用程序中的一个基本问题。在本研究中，我们探索了将原始数据集中的变量划分为不同组的场景。随后，我们研究了一个优化问题，其中包含额外的公平性约束（即分区矩阵约束），限制从每组中选择指定数量的变量，这被称为公平子集选择（FSS）问题。首先，针对现有的Pareto优化算法在解生成过程中没有考虑公平性约束，不能很好地处理公平性约束的情况。本文通过引入一个设计的公平平衡翻转算子，提出了一种用于FSS的Pareto优化算法POFSS。同时，我们证明了当$ l \lt \log _{2}{n}$和突变概率为常数$c/n~(c\lt 2)$时，POFSS在多项式时间内具有$\max \{ {}[{\alpha }/{l}](1-e^{-\gamma }), {}({\gamma }/{k}), 1-e^{-r\overline {k}/k } \}$的近似能力，并且该近似比不差于先前的理论结果（$\alpha $, $\gamma $表示两个亚模比，l为不相交群的个数）。此外，我们将POFSS应用到一个典型的FSS任务个性化推荐（PR）中，设计了加速策略并严格证明了加速比。最后，在PR任务上的实验结果表明，所提出的POFSS在处理FSS方面优于目前最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Evolutionary Computation 工程技术-计算机：理论方法

CiteScore

21.90

自引率

9.80%

发文量

196

审稿时长

3.6 months

期刊介绍： The IEEE Transactions on Evolutionary Computation is published by the IEEE Computational Intelligence Society on behalf of 13 societies: Circuits and Systems; Computer; Control Systems; Engineering in Medicine and Biology; Industrial Electronics; Industry Applications; Lasers and Electro-Optics; Oceanic Engineering; Power Engineering; Robotics and Automation; Signal Processing; Social Implications of Technology; and Systems, Man, and Cybernetics. The journal publishes original papers in evolutionary computation and related areas such as nature-inspired algorithms, population-based methods, optimization, and hybrid systems. It welcomes both purely theoretical papers and application papers that provide general insights into these areas of computation.