Lei Zhang;Zhanpeng Wang;Haipeng Yang;Xiang Sun;Fan Cheng
{"title":"Pareto Optimization for Fair Subset Selection: A Case Study on Personalized Recommendation","authors":"Lei Zhang;Zhanpeng Wang;Haipeng Yang;Xiang Sun;Fan Cheng","doi":"10.1109/TEVC.2024.3487624","DOIUrl":null,"url":null,"abstract":"Subset selection is a fundamental problem across a wide range of applications. In this study, we explore scenarios where the variables within the original dataset are divided into distinct groups. Subsequently, we investigate an optimization problem that includes extra fairness constraints (i.e., partition matroid constraints), restricting the selection of a specified number of variables from each group, which is known as the fair subset selection (FSS) problem. First, for the case where the existing Pareto optimization algorithms do not have the ability to well handle the fairness constraints, due to they do not consider fairness constraints in the process of solutions generation. In this article, a Pareto Optimization algorithm named POFSS is proposed for FSS, by introducing a designed fairness balance flip operator. Also, we prove that POFSS has the approximation ability of <inline-formula> <tex-math>$\\max \\{ {}[{\\alpha }/{l}](1-e^{-\\gamma }), {}({\\gamma }/{k}), 1-e^{-r\\overline {k}/k } \\}$ </tex-math></inline-formula> in polynomial time when <inline-formula> <tex-math>$ l \\lt \\log _{2}{n}$ </tex-math></inline-formula> and the probability of mutation is constant <inline-formula> <tex-math>$c/n~(c\\lt 2)$ </tex-math></inline-formula> and this approximation ratio is no worse than the previous theoretical result (<inline-formula> <tex-math>$\\alpha $ </tex-math></inline-formula>, <inline-formula> <tex-math>$\\gamma $ </tex-math></inline-formula> represent two submodular ratios, and l is the number of disjoint groups). In addition, we apply POFSS to one typical FSS task named personalized recommendation (PR), where an acceleration strategy is designed and the acceleration ratio is strictly proved. Finally, the experimental results on the PR task show the proposed POFSS outperforms the state-of-the-art methods in addressing the FSS.","PeriodicalId":13206,"journal":{"name":"IEEE Transactions on Evolutionary Computation","volume":"29 5","pages":"2198-2212"},"PeriodicalIF":11.7000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10737706/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Subset selection is a fundamental problem across a wide range of applications. In this study, we explore scenarios where the variables within the original dataset are divided into distinct groups. Subsequently, we investigate an optimization problem that includes extra fairness constraints (i.e., partition matroid constraints), restricting the selection of a specified number of variables from each group, which is known as the fair subset selection (FSS) problem. First, for the case where the existing Pareto optimization algorithms do not have the ability to well handle the fairness constraints, due to they do not consider fairness constraints in the process of solutions generation. In this article, a Pareto Optimization algorithm named POFSS is proposed for FSS, by introducing a designed fairness balance flip operator. Also, we prove that POFSS has the approximation ability of $\max \{ {}[{\alpha }/{l}](1-e^{-\gamma }), {}({\gamma }/{k}), 1-e^{-r\overline {k}/k } \}$ in polynomial time when $ l \lt \log _{2}{n}$ and the probability of mutation is constant $c/n~(c\lt 2)$ and this approximation ratio is no worse than the previous theoretical result ($\alpha $ , $\gamma $ represent two submodular ratios, and l is the number of disjoint groups). In addition, we apply POFSS to one typical FSS task named personalized recommendation (PR), where an acceleration strategy is designed and the acceleration ratio is strictly proved. Finally, the experimental results on the PR task show the proposed POFSS outperforms the state-of-the-art methods in addressing the FSS.
期刊介绍:
The IEEE Transactions on Evolutionary Computation is published by the IEEE Computational Intelligence Society on behalf of 13 societies: Circuits and Systems; Computer; Control Systems; Engineering in Medicine and Biology; Industrial Electronics; Industry Applications; Lasers and Electro-Optics; Oceanic Engineering; Power Engineering; Robotics and Automation; Signal Processing; Social Implications of Technology; and Systems, Man, and Cybernetics. The journal publishes original papers in evolutionary computation and related areas such as nature-inspired algorithms, population-based methods, optimization, and hybrid systems. It welcomes both purely theoretical papers and application papers that provide general insights into these areas of computation.