Federated $c$-Means and Fuzzy $c$-Means Clustering Algorithms for Horizontally and Vertically Partitioned Data

José Luis Corcuera Bárcena;Francesco Marcelloni;Alessandro Renda;Alessio Bechini;Pietro Ducange
{"title":"Federated $c$-Means and Fuzzy $c$-Means Clustering Algorithms for Horizontally and Vertically Partitioned Data","authors":"José Luis Corcuera Bárcena;Francesco Marcelloni;Alessandro Renda;Alessio Bechini;Pietro Ducange","doi":"10.1109/TAI.2024.3426408","DOIUrl":null,"url":null,"abstract":"Federated clustering lets multiple data owners collaborate in discovering patterns from distributed data without violating privacy requirements. The federated versions of traditional clustering algorithms proposed so far are, however, “lossy” since they fail to identify exactly the same clusters as the original versions executed on the merged data stored in a centralized server, as would happen if no privacy constraint occurred. In this article, we propose federated procedures for losslessly executing the C-means (CM) and the fuzzy C-means (FCM) algorithms in both horizontally and vertically partitioned data scenarios, while preserving data privacy. We formally prove that the proposed federated procedures identify the same clusters determined by applying the algorithms to the union of all local data. Further, we present an extensive experimental analysis for characterizing the behavior of the proposed approach in a typical federated learning scenario, that is, as the fraction of participants in the federation changes. We focus on the federated FCM and the horizontally partitioned data, which is the most interesting scenario. We show that the proposed procedure is effective and is able to achieve competitive performance with respect to two recently proposed versions of federated FCM for horizontally partitioned data.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6426-6441"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10595840","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10595840/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Federated clustering lets multiple data owners collaborate in discovering patterns from distributed data without violating privacy requirements. The federated versions of traditional clustering algorithms proposed so far are, however, “lossy” since they fail to identify exactly the same clusters as the original versions executed on the merged data stored in a centralized server, as would happen if no privacy constraint occurred. In this article, we propose federated procedures for losslessly executing the C-means (CM) and the fuzzy C-means (FCM) algorithms in both horizontally and vertically partitioned data scenarios, while preserving data privacy. We formally prove that the proposed federated procedures identify the same clusters determined by applying the algorithms to the union of all local data. Further, we present an extensive experimental analysis for characterizing the behavior of the proposed approach in a typical federated learning scenario, that is, as the fraction of participants in the federation changes. We focus on the federated FCM and the horizontally partitioned data, which is the most interesting scenario. We show that the proposed procedure is effective and is able to achieve competitive performance with respect to two recently proposed versions of federated FCM for horizontally partitioned data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
水平和垂直分割数据的联邦$c$-Means和模糊$c$-Means聚类算法
联邦集群允许多个数据所有者协作从分布式数据中发现模式,而不会违反隐私要求。然而,到目前为止提出的传统聚类算法的联邦版本是“有损的”,因为它们无法识别与存储在集中服务器上的合并数据上执行的原始版本完全相同的集群,如果没有隐私约束,就会发生这种情况。在本文中,我们提出了在水平和垂直分区数据场景中无损执行C-means (CM)和模糊C-means (FCM)算法的联邦过程,同时保护数据隐私。我们正式证明了所提出的联邦过程通过将算法应用于所有本地数据的联合来识别相同的聚类。此外,我们提出了一个广泛的实验分析,以描述在典型的联邦学习场景中所提出的方法的行为特征,即随着联邦中参与者的比例的变化。我们主要关注联邦FCM和水平分区数据,这是最有趣的场景。我们证明了所提出的过程是有效的,并且能够相对于最近提出的两个版本的水平分区数据的联邦FCM实现竞争性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
期刊最新文献
ICAFS: Inter-Client-Aware Feature Selection for Vertical Federated Learning. 2025 Index IEEE Transactions on Artificial Intelligence Table of Contents Front Cover IEEE Transactions on Artificial Intelligence Publication Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1