Automated Cluster Elimination Guided by High-Density Points

IF 10.5 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Cybernetics Pub Date : 2025-02-21 DOI:10.1109/TCYB.2025.3537108
Xianghui Hu;Yichuan Jiang;Witold Pedrycz;Zhaohong Deng;Jianwei Gao;Yiming Tang
{"title":"Automated Cluster Elimination Guided by High-Density Points","authors":"Xianghui Hu;Yichuan Jiang;Witold Pedrycz;Zhaohong Deng;Jianwei Gao;Yiming Tang","doi":"10.1109/TCYB.2025.3537108","DOIUrl":null,"url":null,"abstract":"Determining the optimal number of clusters in cluster analysis without prior knowledge remains a critical and challenging task. Existing methods often depend on calculating clustering validity indices (CVIs), which increases complexity and may reduce efficiency. Furthermore, different CVIs frequently suggest varying optimal cluster numbers, complicating the selection process. To address these challenges, we propose a novel clustering algorithm, self-regulating possibilistic C-means (PCM) with high-density points (SR-PCM-HDP), which simplifies cluster number determination while improving clustering efficiency. First, the density-based knowledge extraction (DBKE) method is introduced to estimate an appropriate initial cluster number and identify high-density points. DBKE enhances the density peak clustering (DPC) algorithm by removing the need for a predefined density radius. Second, SR-PCM-HDP refines the clustering process by incorporating a parameter to balance the interactions between high-density points and cluster centers, reducing sensitivity to initial configurations and accelerating convergence. Third, the parameter adjustment mechanism in classical PCM is redefined to enable adaptive updates during SR-PCM-HDP iterations. This mechanism facilitates the gradual elimination of obsolete clusters and iterative cluster formation. The theoretical foundations of the SR-PCM-HDP cluster elimination mechanism are rigorously established. Experimental results validate the accuracy and effectiveness of SR-PCM-HDP in determining cluster numbers and ensuring clustering validity, particularly for datasets with overlapping or imbalanced distributions. Comparisons are conducted against 13 state-of-the-art algorithms, including fuzzy clustering, possibilistic clustering, and CVI-based cluster determination methods.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 4","pages":"1717-1730"},"PeriodicalIF":10.5000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10899193/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Determining the optimal number of clusters in cluster analysis without prior knowledge remains a critical and challenging task. Existing methods often depend on calculating clustering validity indices (CVIs), which increases complexity and may reduce efficiency. Furthermore, different CVIs frequently suggest varying optimal cluster numbers, complicating the selection process. To address these challenges, we propose a novel clustering algorithm, self-regulating possibilistic C-means (PCM) with high-density points (SR-PCM-HDP), which simplifies cluster number determination while improving clustering efficiency. First, the density-based knowledge extraction (DBKE) method is introduced to estimate an appropriate initial cluster number and identify high-density points. DBKE enhances the density peak clustering (DPC) algorithm by removing the need for a predefined density radius. Second, SR-PCM-HDP refines the clustering process by incorporating a parameter to balance the interactions between high-density points and cluster centers, reducing sensitivity to initial configurations and accelerating convergence. Third, the parameter adjustment mechanism in classical PCM is redefined to enable adaptive updates during SR-PCM-HDP iterations. This mechanism facilitates the gradual elimination of obsolete clusters and iterative cluster formation. The theoretical foundations of the SR-PCM-HDP cluster elimination mechanism are rigorously established. Experimental results validate the accuracy and effectiveness of SR-PCM-HDP in determining cluster numbers and ensuring clustering validity, particularly for datasets with overlapping or imbalanced distributions. Comparisons are conducted against 13 state-of-the-art algorithms, including fuzzy clustering, possibilistic clustering, and CVI-based cluster determination methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高密度点引导的自动聚类消除
在无先验知识的聚类分析中确定最优聚类数仍然是一项关键而具有挑战性的任务。现有方法往往依赖于计算聚类有效性指标,这增加了算法的复杂性,降低了效率。此外,不同的CVIs通常建议不同的最优聚类数,使选择过程复杂化。为了解决这些问题,我们提出了一种新的聚类算法——高密度点自调节可能性c均值(PCM)算法(SR-PCM-HDP),该算法在简化聚类数确定的同时提高了聚类效率。首先,引入基于密度的知识提取(DBKE)方法,估计合适的初始聚类数,识别高密度点;DBKE通过消除对预定义密度半径的需要来增强密度峰值聚类(DPC)算法。其次,SR-PCM-HDP通过引入一个参数来平衡高密度点和聚类中心之间的相互作用,从而改进聚类过程,降低对初始配置的敏感性,加速收敛。第三,重新定义经典PCM中的参数调整机制,使SR-PCM-HDP迭代过程中的参数自适应更新成为可能。这种机制有利于逐步淘汰过时的集群和迭代形成集群。建立了SR-PCM-HDP簇消除机制的理论基础。实验结果验证了SR-PCM-HDP在确定聚类数和确保聚类有效性方面的准确性和有效性,特别是对于分布重叠或不平衡的数据集。比较了13种最先进的算法,包括模糊聚类、可能性聚类和基于cvi的聚类确定方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Cybernetics
IEEE Transactions on Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS
CiteScore
25.40
自引率
11.00%
发文量
1869
期刊介绍: The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.
期刊最新文献
Guiding Multiagent Multitask Reinforcement Learning by a Hierarchical Framework With Logical Reward Shaping. Accelerated Energy-Saving Learning Control for Stochastic Point-to-Point Tracking Systems. An Improved Quadratic Function Negative Definiteness Lemma for the Stabilization of Nonlinear Cyber-Physical Systems With Actuator Faults. A Predefined-Time Robust Neural Dynamics Controller for Projective Synchronization of Second-Order Chaotic Systems and Its Application. Zentropy-Enhanced Multigranularity Knowledge Modeling for Robust Feature Selection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1