Selecting Central and Divergent Samples via Leading Tree Metric Space for Semisupervised Learning

IF 11.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Fuzzy Systems Pub Date : 2025-01-13 DOI:10.1109/TFUZZ.2025.3528400
Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang
{"title":"Selecting Central and Divergent Samples via Leading Tree Metric Space for Semisupervised Learning","authors":"Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang","doi":"10.1109/TFUZZ.2025.3528400","DOIUrl":null,"url":null,"abstract":"The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 5","pages":"1578-1591"},"PeriodicalIF":11.9000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10839088/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于领先树度量空间的半监督学习中心和发散样本选择
标记数据的分布会极大地影响半监督学习(SSL)模型的性能。现有的SSL模型大多随机选择标记数据,并在类之间平均分配标记配额,这导致了相当大的不稳定性和性能下降。本研究无监督地构造了一个先导森林,该先导森林形成了另一个度量空间,在此基础上可以方便地定义模糊隶属函数来表征中心样本和发散样本,并利用模糊异或逻辑选择这两种类型。因此,标签配额可以在不同的类别之间自适应地分配。所提出的确定性标记策略通常可以提高大多数ssl的性能。特别地,当与核化大余量成分分析相结合时,产生了一种新的半监督分类模型。此外,领先森林的多颗粒结构可以有效地解决SSL中的多模态问题,从而容易地促进多个局部度量学习。大量的实验结果表明,与现有的方法相比,该方法具有竞争力的效率和令人鼓舞的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems 工程技术-工程:电子与电气
CiteScore
20.50
自引率
13.40%
发文量
517
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.
期刊最新文献
Information Granule-Based Time Series Prediction via Synergizing Multiscale and Multitype Information Granulations A Fuzzy Large TabNet-Based Model and its Distillation Learning for Noisy-Labeled Data Via Consequent Additive Decomposition Efficient Fuzzy Model Predictive Tracking Control of Nonlinear Systems: A Membership Function Approximation Approach Vision-Language-Action Model-Based Event-Triggered Admittance Control of a Mobile Manipulator for Power Substation Live-Maintaining Automatic programming via large language models with population self-evolution for dynamic fuzzy job shop scheduling problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1